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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4047 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AAGCTTTTTG CTTTCCTTCC CCGGGAAAGG CCGGGGCCAG AGACCCGCAC TCGGACCAGG 60 

CGGGGGCTGC GGGGCCAGAG TGGGCTGGGG AGGGCTGGGA GGGCGTCTGG GGCCGGCTCC 120 

TCCAGGCTGG GGGCCGCCAG CTCCGGGAAG GCAGTCCTGG CCTGCGGATG GGGCCGCGCG 180 

TGGGGCCCGG CGGGGCGGCC TCGGGAGGCG TCCAGGCTGC GGGAGCGGGA GGAGCGGCCG 240 

TGCGGGCGCC AGCGCCGTGG GTGGAGGTCG CCGTCCCTCC TGAGGGGCAG CCAGTGCGTT 300 

TGGGACCCGG GAGCAGAGCC CGCGCCTCCC CAGCGGCCTC CCCGGGGGTC TCACCGGGTC 360 

ACCCGAGAGC GGAGGCCCCG GCTCCGCAGA AACCCGGGGC GGCCGCGGGG AAGCAGCGCC 420 

CTCAGGCGTC GGAGGAGCCC CCAGAAGGAC CTCGCGCCTT CCCGCCGGGC TCCGACCGCC 4 80 

TGGGTTCGGT GCGGGACGGC CCAGGCCGCC AGGACCCCCA AGCGCAGCTC AGTCTGCGGG 54 0 

GCACGACCCA GAGGCCAGCA GCAGAGGACG GGGCCGGGGC CGGGAGAGGG CGGGGAGGGC 600 

n GCTCCTGGGA GGTCAAGGCC AGGGCTAGAC TTTCAGGGTC ATGGCCTGGC CCCTCATCCC 660 

~i CAGGGAGGTG AGGGGGCTCT GTGAGCAGAG GGGGCCCCGG TGGAGAAGGC GCTGCTAGCC 720 

/H AGGGGCGGGG CAGGAGCCCA GGTGGGGACT TAAGGGTGGC TGAAGGGACC CTCAGGCTGC 780 

^jj AGGGATAGGG AGGGAAGCTA GGGGTGTGGC TTGGGGAGGT GCTGGGGGAC CGCGGGCGCC 840 

3 CTTTATTCTG AAGCCGAATG TGCTGCCGGA GTCCCCAGTG ACCTAGAAAT CCATTTCAAG 900 

O ATTTTCAGGA GTTTCAGGTG GAGACAAAGG CCAGGCCCAG GTGAAAATGT GGCAGTGACA 960 

ly GAGTATGGGG TGAGAACCAC GGAGAGAGGA AGTCCCCGAG GCGGATGATG GGACAGAGAG 1020 

Ul CGGGGACCAG AATTTTTTAA AACGCATCTG AGATGCGTTT GGCAGACTCA TAGTTGTTTT 1080 

£ CCTTTCACGG AGAAAGTGTG GGCAGAAGCC AGCTCTAAAG CCCAGGCTGC CCAGCCTGCA 114 0 

5 = CTGGCAGAGC TGACGGAAGG CCAGGGCAGA GCCTTCCCTC CCTGTCACAG ACATGAGCCC 1200 

L TGGAGATCTG GAATGAGGCA GATGTGCCCA GGGAAAGCTG ATCCGCCCCG ACCCAGGGCC 1260 

^ CCCCGGGTGC CCCTTTGAGC GTGGAATCGT TGCCAGGTCA TGGCTCCCTG CTATCGAACA 1320 

I a CCGGACACGG GTCGTGTGCT GCACCTGGCA GTTGCAGGAC CGACACCCAC AATGCCTTAA 1380 

GAGGTGATGA CTGCCTTCCA GGGGCCTGGC TGGCTGACAC TTTGCATGGC TCCTGGAGAA 14 40 

W GAGGGATTGA GTGGAGTCCA CGGGTCATGG CCACGTCCTG GGTGCTGCCT CTGAGGCAGG 1500 

O GCCCGGCTGG GGTGAGAAGG GGCTGGAGAC AGGTTCCTGC CAGTTCAGCC TCTAACCGGT 1560 

P GGTCTTCATG CCTAGGAACC CACTGGGGGC TTATGAAACT GCAGGTGGCT GAGTCCTTGC 1620 

CATGGGGTCT CTCCTTCAGG AGGTCTGGGT GGGGCCGGAG ACTGTACCCC ACAAAGGGTC 1680 

CCAGGTGAGG CGGATGTGGC CTGGCGCTGT GTGGCTCTGG ACCTAGTCCT TGGGCTTGGG 17 4 0 

CTGGCGCCCA GGGCCTGGGC TTGAGACAGC TGTGACGCAG GCAAGCCATT TACCCCGTTT 1800 

GTGGGGACAT TACATCTTCC TAGCTTGGAA CACACAGGCA GCCAGGGTTG TTATCCACAT 18 60 

TCCTCCTCCA TGTTCTTCTC TTGAGAACTT TTACCAGGTA TGTCAGGAGC TGGGCTCCAC 1920 

CAGGGAGACT CAAGTGGAAA GCCCTCATCC TTGTCCTCCA GGAGACAGGA AAACCTATGG 1980 

TTACAATTCC AGGGACAAGA GCGATGCATG TGAGGTGTGG CAAATCTCAC TGTTCAACTG 204 0 

GAGAAATCAG AGACAGCTTC CTGGAGGCAG TGACACCTGG ACAGGCTTCT CCACAGGAGG 2100 

- AAGCGAGTGA GAGAAGCCAA CTGGGATGGA CCCATCATGT AGGGGGAACA GTGCGCGCAG 2160 

AACCAACAAC CACCCCCACC CTAGGCCCAG AGCTCACGGA GAGAGCTGGG CCTCTCGGGG 2220 

TGACTACATA GTTCCCTGCT GGATCTTAGG TCTTGTCCTT GGGCAGCTCT GCTGAGACCT 2280 

CTATGCCTGT TCCAGGCTGC ACCAAGGTTT TGTGACTATT GGTCTGGGGT TGTTTTGCAG 234 0 

CAACTGAAGT GTTCTGTTGT AAAACAGGCA CTTGATTTGC TGGAAGGAAT GCTGTTTGTT 2400 

CTTGCTGCGA CAAACATTGA GCAGCATTTA GTGGGCGGTT TATATCTTGT GGAGTAATGG 24 60 

GTGTTTTTGA AGTCTGTCCT GGGTACTGCA CATTAAAAGG AATATCATTT TCTGAAACAT 2520 

TGCTATTTTC CACACCAGAA ATCATATCCT CTTGCTGGTC CATGTCTGAA GACCTTACAC 2580 

GAGAAAGTCT TAATGTAAGT TTAGTAGAGT CCTTGGATGG AGAACTAATT ATATCATACA 2640 

TTGCCGCTTT CTCACTCTGC TCTTTTTCAT CCTTGCCTAA TTTCATTTTC TTCTGCTTCT 2700 

TTTGTTTTCT TTCTGGAGAA TCTAGCAAGA TATCTGGTGG AACATCTCGA GGTGATGAAC 27 60 

AAGGTAGAGA CTGAGATTGT AGGATTAAAG GTGGTCTTGA GCCTTTAGGA GTTCCTTCAC 2820 

TTCCAGCAGG GGAGCATACT GGCTGTGGAG ATCTCAAGGG AAAAGATGCA GCATTCCTCA 2880 

TTGTTGAAGA ATCTCCATCG TCACTACTTA GCCTGTGCAC CATGTGTAGG TAGTCCTCAC 2940 

TTGAACCATG TCTAGGATTA TCAGCATGAT GATTAGCTGA ATTGCCAGAC AACGGACCAG 3000 
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u 



m 



AAACTTTATT 
CTTGTTGCAT 
CAGGGCTTTG 
TCCCAGAGCT 
GTTGATAATT 
ACATCATCAT 
GACTCCTTGC 
CAAGGTTATC 
GTGAAACCAA 
CATTAAAGAG 
TCAGGAGGTC 
CCCCATTCAT 
AGCTGTCATT 
GGCCGAGGCC 
CCTCTTCCGT 
CAAATCGTTG 
ACCCGACCAC 
CTGTCGGCGG 



ATCATGTATG 
CTGTGGATGT 
TGGGGCGTAT 
TGTCTGTGGT 
GGAAGATGCA 
TCCACTTTGT 
CAGGACGGCC 
TTTCAACTCT 
ATTGTCATCC 
AAGGCTCTTT 
TGTGAGACTA 
CCTGAATTTC 
ACTTCCCATT 
GCTGTGGCGA 
CTCCTCAGTG 
CTTCTTCTTC 
GGGCCCGAGC 
GGGCGTCCTT 



TTTCTCAAAC 
GTTGTGTAAC 
GGACTAGGCA 
GGCACAAACC 
GGACTACTGT 
ACATATCTGT 
TGCAACAAGA 
ATGTGATCTG 
CTACAAGCCA 
GTAGTTGTAG 
GCAATCCCCG 
TGGAATGGTG 
TCCCAAACAC 
GAGACTATAA 
CGGGGAACAT 
CTCCTCCTCC 
TACCGGGGGG 
TGGATCC 



CACCTGCAAC 
TTGAAGGATG 
CTGGGCTATT 
GGCTGGAGGG 
GCATGGAATT 
TCTGCATGCT 
CTGGTATGTC 
TTGATACCTG 
AAAGGCAGTT 
CAGGTAAAGG 
CAAGAGTAGT 
TTGCCTATAA 
TGGGCGAATC 
TCCGGGCCGG 
GTAGAGCCGG 
TCCTTCTCCC 
GCATCGCCGC 



AATGGGACTT 
GGAATATGGC 
TTGCTGTGGC 
GCTATGTGAG 
CTGAGAAAGT 
TTTCTCCCTG 
ACCTTCTGGG 
GTTGAGGCTA 
CACCTCTTCT 
AGATGGAAGA 
AATGGGGACA 
AAGTACTTAG 
GGCGTCTGAA 
GAGGGGGGGC 
GGGGAGACCA 
ACATAGAAAC 
GGGCCCGGGA 



GATACCGATG 
ATGTATCCTG 
ATAAATCTGT 
ATAGTGGTTT 
TTATACTGAG 
AAAACATTAG 
TCATCACTGC 
TGGACAAGCT 
GCTATTCGTG 
GGCAGCTGGT 
TGGGGCATAT 
TTCAGGTGCC 
TCCAAGGGGA 
GGCTACGGCT 
GCCGAGAAGA 
ACTCACAAAC 
ACCAATTCTC 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 739 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4047 



Mi 
U 



(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:2; 



GGATCCAAAG 
CGCGCGGCCG 
ACCGGGTCAG 
CGGACCCCGC 
TTCTACACCT 
TTTCCCCGCT 
GGGACAGTGG 
TGGCTACCTT 
TCACTTTGAC 
TTCGCGATGC 
CGGCTGCTAG 
CCGCGGGGGG 
CCGAAGAAGA 



GTCAAACTCC 

GGCGCTTGGC 

TGAAAAAACG 

CCCGGGCCCC 

CTCATGTCTC 

GATTCCGCCA 

GAATCTCGTT 

AAGAGAGTCA ' 

ATTCAGAGCA 

TTTGTTTTAA 

GCGCCGGCCG 

GACCGGGCCG 

AGGGGGAAA 



CCACCTGGCA 
GCCAGAAGCG 
AT C AG AG TAG 
TCGCGGGGAC 
TTCACCGTGC 
AGCCCGTTCC 
CATCCATTCA 
TAGTTACTCC 
CTGGGCAGAA 
TTAAACAGTC 
AAGCGAGGCG 
CGGCCCCTCC 



CTGTCCCCGG 
AGAGCCCCTC 
TGGTATTTCA 
ACCGGGGGGG 
CAGACTAGAG 
CTTGGCTGTG 
TGCGCGTCAC 
CGCCGTTTAC 
ATCACATCGC 
GGATTCCCCT 
CCGCGCGGAA 
GCCGCCTGCC 



AGCGGGTCGC 
GGGGCTCGCC 
CCGGCGGCCC 
CGCCGGGGGC 
TCAAGCTCAA 
GTTTCGCTGG 
TAATTAGATG 
CCGCGCTTCA 
GTCAACACCC 
GGTCCGCACC 
CCGCGGCCCC 
GCCGCCGCCG 



GCCCGGCCGG 
CCCCCGCCTC 
GCAGGGCCGG 
CTCCCACTTA 
CAGGGTCTTC 
ATAGTAGGTA 
ACGAGGCATT 
TTGAATTT^T 
GCCGCGGGCC 
AGTTCTAAGT 
CGGGGCGGAC 
CCGCCGCGCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
739 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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CAAGAGTGGC GGCCGCAGCA GGCCCCCCGG GTGCCCGGGC CCCCCTCGAG GGGGACAGTG 60 

CCCCCGCCGC GGGGGCCCCG CGGCGGGCCG CCGCCGGCCC CTGCCGCCCC GACCCTTCTC 120 

CCCCCGCCGC CGCCCCCACG CGGCGCTCCC CCGGGGAGGG GGGAGGACGG GGAGCGGGGG 180 

AGAGAGAGAG AGAGAGAGGG CGCGGGGTGG CTCGTGCCGA ATTCAAAAAG CTT 233 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2998 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGATCCAAAG AATTCGGCAC GAGGTAGTCA CGGCTCTTGT CATTGTTGTA CTTGACGTTG 60 

p AGGCTGGTGA GCTTGGAAAA GTCGATGCGC AGCGTGCAGC AGGCGTTGTA GATGTTCTGC 120 

J CCGTCCAGCG ACAGCTTGGC GTGCTGGGCG CTCACGGGGT CCGCATACTG CAGCAGGGCC 180 

%.j TGGAACTGGT TGTTCTTGGT GAAGGTGATG ATCTTCAACA CTGTGCCGAA CTTGGAGAAA 24 0 

r5 ATCTGGTGCA GCACATCCAG GGTCACAGGG TAGAAGAGGT TCTCCACGAT GATCCTGAGC 300 

S ACGGGGCTCT GCCCGGCCAT CGCCATCCCT GCATCCACGG CCGCCGCCGA GGCAGCCAAG 360 

~j GCCAGGTTCC CCGACTGGAC CGAGTTCACC GCCTGCAGGG CCGCCTGGGC CCGCGCCTGG 420 

JXJ TTGGGAGAGC TGTCGGTCTT CAGCTCCTTG TGGTTGGAGA ACTGGATGTA GATGGGCTGG 480 

4] CCGCGCAGCA CAGGGGTCAC CGAGGTGTAG TAGTTCACCA TGGTATTGGC AGCCTCCTCC 540 

=P GTGTTCATCT CGATGAAGGC CTGGTTTTTC CCCTTCAGCA TCAGGAGGTT GGTGACCTTC 600 

a CCAAAGGGCA GCCCCAGGGA GATGACTTCC CCCTCCGTGA CGTCGATGGG GAGCTTCCGG 660 

M ATGTGGATCA CTCTAGAGGG GACGCCTGCA CTTCGGCTGT CACCTTTGAA CTTCTTGCTG 720 

H= TCATTTCCGT TTGCTGCAGA AGCCGAGTTG CTGCTCATGA TAAACGGTCC GTTAGTGACA 780 

iU CAAGTAGAGA AAAGCTCGTC AGATCCCCGC TTTGTACCAA CGGCTATATC TGGGACAATG 840 

Ut CCGTCCATGG CACACAGAGC AGACCCGCGG GGGACGGAGT GGAGGCGCCG GAATCCTGGA 900 

S GCTAGAGCTG CAGATTGAGT TGCTGCGTGA GACGAAGCGC AAG TAT GAGA GTGTCCTGCA 960 

S GCTGGGCCGG GCACTGACAG CCCACCTCTA CAGCCTGCTG CAGACCCAGC ATGCACTGGG 1020 

U TGATGCCTTT GCTGACCTCA GCCAGAAGTC CCCAGAGCTT CAGGAGGAAT TTGGCTACAA 1080 

TGCAGAGACA CAGAAACTAC TATGCAAGAA TGGGGAAACG CTGCTAGGAG CCGTGAACTT 1140 

CTTTGTCTCT AGCATCAACA CATTGGTCAC C AAG AC CAT G GAAGACACGC TCATGACTGT 1200 

GAAACAGTAT GAGGCTGCCA GGCTGGAATA TGATGCCTAC CGAACAGACT TAGAGGAGCT 1260 

GAGTCTAGGC CCCCGGGATG CAGGGACACG TGGTCGACTT GAGAGTGCCC AGGCCACTTT 1320 

CCAGGCCCAT CGGGACAAGT ATGAGAAGCT GCGGGGAGAT GTGGCCATCA AGCTCAAGTT 1380 

CCTGGAAGAA AACAAGATCA AGGTGATGCA CAAGCAGCTG CTGCTCTTCC ACAATGCTGT 14 40 

GTCCGCCTAC TTTGCTGGGA ACCAGAAACA GCTGGAGCAG ACCCTGCAGC AGTTCAACAT 1500 

- CAAGCTGCGG CCTCCAGGAG CTGAGAAACC CTCCTGGCTA GAGGAGCAGT GAGCTGCTCC 1560 

CAGCCCAACT TGGCTATCAA GAAAGACATT GGGAAGGGCA GCCCCAGGGT GTGGGAGATT 1620 

GGACATGGTA CATCCTTTGT CACTTGCCCT CTGGCTTGGG CTCCTTTTTC TGGCTGGGGC 1680 

CTGACACCAG TTTTGCCCAC ATTGCTATGG TGGGAAGAGG GCCTGGAGGC CCAGAAGTTG 1740 

CTGCCCTGTC TATCTTCCTG GCCACAGGGC TTCATTCCCA GATCTTTTCC TTCCACTTCA 1800 

CAGCCAACGG C TAT G AC AAA ACCACTCCCT GGCCAATGGC ATCACTCTTC AGGCTGGGGT 1860 

GTGCTCCCTG ACCAATGACA GAGCCTGAAA ATGCCCTGTC AGCCAATGGC AGCTCTTCTC 1920 

GGACTCCCCT GGGCCAATGA TGTTGCGTCT AATACCCTTT GTCTCTCCTC TATGCGTGCC 1980 

CATTGCAGAG AAGGGGACTG GGACCAAAGG GGTGGGGATA ATGGGGAGCC CCATTGCTGG 2040 

CCTTGCATCT GAATAGGCCT ACCCTCACCA TTTATTCACT AATACATTTT ATTTGTGTTC 2100 

TCTAATTTAA AATTACCTTT TCATCTTGCT TGATTTTCCT TCAGCTAAAT TAGAAATTTG 2160 

TAGTTTTTCC CCTAAAAAAT TCAATGGCAT TCTTTCTTAT AAATTACATT CTCTGATTTT 2220 

CTTGTCAGCC TGCTTCAAGG AAATCCATGT GTTCAAAATG CTTGCTCGCA GTTTGCTCCA 2280 

TACCAAATGG TTGCTTAACC CAAATATCTG AGCAGCAAAT TGAGCTGATC CTTCTGGAGA 234 0 

AAGTACGGTT GAACAGCCAA GACCACTGGG TAGTCGAAGA GAAGACCACA CATCCTGAAC 2400 
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TCCCCAGTCT GGTGTGAGGG GAGGACAGCT GATAACTGGA TATGCAGTGT TCCCAGACAT 24 60 

CACTGGTCCC AAACCATTAC TTCTGCCTGC CACTGCCACA AATACAGTAG GAATGCCATC 2520 

CCCTTCATAC TCAGCTTTAA TCCTCAGAGT TTCATCTGGT CCTTTATGCG CAGATGTTAC 2580 

TCGAAGTTCA CATGGAATGC CAAAATTTCC ACAGGCCTTC TTGATTTTTT CACAGTGACC 264 0 

AAGATCAGAA GTAGAGCCCA TCAACACTAC AACCCTGCAC TGACTTTCTG ATTTCAAAAG 2700 

CAACTCTACT CTCTCTGCAA CCCACTCAAA GTTTTTCTTT ACCATTTGGA GCCCTTCAGG 27 60 

AGTTACTTCT TTGAGGTCCC GATAAGACTG TTTGTCTTTC TGTTGGCTTC GATCTCCTGA 2820 

TGGCCAGAGT CTCCAGGAAT CATTGTCAAT AACATCAGCA AGAACAATTT CTTTGGTGGT 2880 

TACATCAACA CCAAATTCAA TCTTCATATC AACCAGTGTA CAATTCTGGG GCAACCAGGA 2 94 0 

TTTCTCCAGT ATTTCAAATA TAGCCTGTGT AGCATCTCGT GCCGAATTCA AAAAGCTT 2 998 



(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4152 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

Q 

(ii) MOLECULE TYPE: Genomic DNA 

N 

Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

o 

gj AAGCTTTTTG TGAAAACCCT AGGATATGTC CCCTCCCTCA CCACACCCAA CCCCCCGCCC 60 

zp CTGCCCCAGG ACATGACGAT GCCTCACACA CACACACACA CACACATACA CACAAGGCCG 120 

^ TGAGCTGCAC GCAGGAACAT GGGCTGCACT CACGACAACA TTGAAAAAAT ATACATTATA 180 

^ TATGTACACC CGGGGCCCCC ACGTCCCCTC CCGTCCCCGC AGCCTGGCCA CACCAGGTCA -24 0 

f 8 CGGAGGAGGG GCCGGGGCTG CAGGACCTCA GGACTGCAAG GGCAGGAAGG GAAACAGGAC 300 

p AAGAAAGGAA GGAAGTTGGA AAGGAGGGAG AAATGGGGTC CCCAGACTGA AATGGAAATG 360 

AGGTGGGGCG ATCATAAGAG AAGCAGGGAC GATGGTCCAG CTGAGGGAGC CCTGCAGAGG 420 

N 5 GGGAAAAGCT TCCCATGGAC AGGAGAGAGA AGGGAAGGGG AGAGGAGAGG GTTTCCTTCA 480 

Ul ATCCCACCCC CAGCCCCAGC CCCAGCCCCA GCCATTGCAA TCGTCACCCT CTCCCCAACA 54 0 

O CAGTGAGTGC TAAGGGGGCA GCTGCCATTG GGGGTAGAAA GGCAGCTGAA GTCCAGCCCA 600 

Q CTTTCCAACC CAGCCAGCCC CAGTGCAAGG GGCACACCAG GAGCATGACA GCCCAGAAGT 660 

GAGGGATGGG GGGCCGGGGG AGGGGCAGGG CGGACTCCAG AGGGCCCGCT GGGGTTTTGA 720 

AATGAAAGGA GGACTGGTTC TGAAGCCTCT CTCCCTCTTG GTCTCTGTGT TCCCAGAAAG 780 

TCCTTCTCCC ATGTCTGGAG TGTCTGTTTC ACCAGGGCAG AATTCCCCCT CTGCGTGGGG 84 0 

AGAGGTGTAG GCCTTAGTAG CGGTGTGGGG GGGTCTCGAT GATGCGTCTC TCGTCGCTGC 900 

TGGGGGAATC GGCCACCTCC GAGTCACTGC TGTCCTCATC CTCCTGCTGG CCCCCAACAG 960 

CCCCCGTCAC ACAGGACTGC CGATTCTGGT AGGACTCCAT GGGGTTCACA ATGATGGTGA 1020 

GAGCTGAGTC ATCCCAGAAG AGGTCTGGGT CCTTGGGGTC ACTGGAGGCC CCTGGAGGCC 1080 

CGCCGGCCCC TGAGACGCGG CGGTGAAGGG AATGGATGCG CACCAGGCCC AGGACGACCA 1140 

- TGAGCACCAG GAAGCCCACG CACACCACAA TGATGAGGGT TGCGGCGCTG GGTATCATGG 1200 

AGTTTCTGTG GGAGCTGGCT AGGCTGTGTC CAGCCATCTC AGGCGGGGGC TGGTGACCAC 12 60 

GGTGCAGGAA CTGCTGGGAG CTGAGCACGT GGCTGGGGTG GGCAACCCGG TTCATGCTGT 1320 

GCAGGACATT GACCTCCACG ATGAATTCAT TGCTGGAGTA ACGGCCATTC ATTTCCGAGC 1380 

AGGAAAGCCG GAACTTCCTG GTGTAGAGGG CAGCTCCGTG TCGCAGCCGA TAACGAGCCT 14 4 0 

GCCTCAGGAT CTCTTCATAC ACAGTGATGC TCTCCACCCC AGCAATAGTG AGGTAGGCAG 1500 

ATGTGTTGGT GAGCTCCAGC CCCCGCTGCT GCAGAGAGGT TGTGTCCAGG AGCAGGCTTT 1560 

CCCGCTCGGG ATCCAGGTCA TCCCCCACCA GAGAAATTTC ACAGCCATCC AGGTTGTGCA 1620 

CAATCTCATC CGACATGCGT GTGTCTGTCA CTGTGCCCTG CCAACTCTCA TCCTTTTTGG 1680 

CCTCCACCTG GTGAGAAATG GAGCAGGTGA TTTGAAGATC AGGGAACAAA GGGACGCCGT 17 4 0 

TGGTTCCCTC AAAGTCCACA GCTGGGCGGG CAAAATGAGC AGTGCCACTC AGCAGGATCT 1800 

GGGGGGCGTC AGGCTGAAGG ACGACCACGT AGCCCTCCAC TTCAGGGATG GAGACGCAGG 18 60 

ACTCTTCGCT GAAGCACTTG ACAGCAGTGG TGAGGCGCAG GGGCCTGACG CCGGGCGTGG 1920 

CAAAGCGCAG AGTGTTCATG TAAGCCACAT GCTGCAGGGC ATGGTTGAAG GTCTCCACAT 1980 

CATCCCCCTC CAGGGTGAGC AGGGACTGTG AGGGGTTCAC GTGGACCTTC ATGCCTTTGC 204 0 
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CCAGGCTCTC GAAATCCCTA TAGTCCAGCC 
CCTCGCGGCT CTCCAGGCGA CCTGAGCGCA 
AGTGGTGGAT CGACAAAGGG TCTCCTTGGG 
TCTCTTTGTT CTTCTCCTCA GTCCAGCAGG 
GTGGGTGGAT GAGGCCATTG TCATGGATGA 
AGAGTGTGAC TGTGGGGAAC TCGAGGTTCA 
CCTGCTCCAG CTTCCAGAGG AACTTGACTG 
GGAAGGCAAT CCTACAGCCG TGGACAGTCA 
GGACAGTGTT ACATACGATG GTTTCCTCTT 
GCTTCATCCA GAAGGACAGG GTGAAGTGGT 
GCCCACTGGG GCCACCCAGG GGCACCTGCA 
GGCTGCTGTC CTGGCTGTAG TGCACCGAGA 
GCATGGGCAA CAGATCCACT TCCCCAGTGG 
CTGAGTAGTT GTCACGGTCA CAGCCCTTGG 
CCTGAATGTT CCAGAGTGGT TCATCACAGG 
AGCTCCCAGC ACCTGGTGCA TATTCGATCC 
AGGTGGGCTT CACCTGAATC TCCACCTCAG 
CATAAGCTGT CACTGTAAAC TTATAGAGCC 
^ TCTCAATGTT CCCGTCATTG TCAATGAGGA 
~ AGCAGATCTG GCTGTACTGG GGGGAGCAGT 
GATCGTACAG CTTCCCCTCT GTCACAGCCG 
NJ CAAACTCGTT CACATCGTTG ACCCGCACAT 
Q TGGCCCCGTC GGGGCCCTCG CCACAGTCAT 
O GGGCCTCGCA GTCCACAGGC TCCTTGGCCC 
yj GGATCACAGC CTCAAAGGGC ACCCCAGACC 
IJl CATAGCGCAG CGGGGCATCC TTGTCCAAGG 
7* TGTCATTCTC CAT G AC GAT G CCCTGGTACT 
CTTTGTTACA GGAGCAGGAC GCGAGCAGAG 
TCATGGTGCG GCGTGGGGCA GGGCAGGGCC 
!"7 CTGCGGATTC CACCTTGCGG GAGGGATACA 
f 4 " AAATTGTGTA GGAGGAGTCC AGCTTAGGAC 
GGGCCTCTGC AGGTTCAGAG GATCACTGCT 
W TTTGCCATGG CCTTGATTGC AACAGCTGCC 
O ' AGGATCTCTT CTCCACAGTC GTACTTCTGC 
Q AGACGAAGGT CCTCTCGTAC CTCCCCGCTG 
ATCTTTGGAT CC 

(2) INFORMATION FOR SEQ 




CCTCCCGACA TGCATAGAGG CACTCGATGA 2100 

CGCTGAAACC AGCCAGGTAG CCATGGAAGT 2160 

TGGTGTCTGT ACTGTTGTCT CCCTTTTCCT 2220 

CCCCAATCAT GAGAGCAGGC TCCCTTCGGG 2280 

GGGCAGGGTC GAAGGAGATG CCGTCGGTAT 234 0 

GAGCGTAGTG GTGCCACTCA T CATC AC AG A 24 00 

GGCGGGCACT CTCAAGCAGG GGCCAGTAGA 24 60 

GCGAGTAGTG AGAGAAGCCG TCCTCATTCT 2520 

CCTTCTTGCC CTTGTTGGGA GTTACGCCAT 2580 

CACTGAGGCT GTCCTGGGGC CCAGAGCCCA 264 0 

CAGCCTGGGT GCCATTGAAC CAGTAGATCA 27 00 

GTCCTGCTGT CCAGTTGGCA TTGGGGCCAG 27 60 

CAGCACCACA GAGTTTCCGC AGCGCCCGCT 2820 

CCACATGGCT GGTCTGCAGC TCTATGGTGG 2880 

TCTCCAGGCG GATACCAGGG AACAAAGCCA 294 0 

TTTTGTTCCA GCCTTGCCAG CTGGGTTTAC 3000 

CATCATCTGC TGCCCGCTTC TTCCCACAGT 3060 

TCTCACCACT GTACTGCAGC TTCTCTGTGT 3120 

AAGGGGTGTT GGGTGTGAGA ATCTCATAGT 3180 

CACCGTCAAT GGCTTCCACC CGCAGGATGC 324 0 

CACGATACAG CCGTTCCACA AACACTGGGG 3300 

GCACAGTGGC CTTGTGGGAC TTCTTGGTGT 3360 

AGGCCTGGAT GGTGAAGGTG TGTTCCTTCT 34 20 

GGATCAGCCC CTCTCCTGTC GCCTTGTCAA 34 80 

CATGGAGCCG GAAGCCGCAG ATCTCACCTG 354 0 

CAAAGAGTGG TGGATTCAGT AGGACCGTGT 3600 

CTGCCTCAAT CCATGGCTTG TGCTTGTTGG 3660 

AGGCCAGCAG AAGGGGCAGC AGCAGGAGGG 3720 

AGGCGTTTGC CTCCCCTGGG AGCCTCCAGC 3780 

GGGGGGGAAA ACCAAAATAA AACGTCAAAT 3840 

CGGGCCAGAG CCAGGCCAGG CTCGGGGAGG 3900 

GCCACCACCG CCACCCTGGG AGCCAGTTAT 3960 

TCCTCTGTCA TGGCAGACAG CACCGTGATC 4 020 

TCAATCTCCT TGCCAAGGTC TCCCTCAGGG 4 080 

TCCTGGAGCA GTGATAGGTA CCCATCCTGG 414 0 

4152 

ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ -ID NO: 6: 



GG AT CCAAAG ATTCGGCACG AGTGGCCACA TCATGAACCT CCAGGCCCAG CCCAAGGCTC 60 

AGAACAAGCG GAAGCGTTGC CTCTTTGGGG GCCAGGAACC AGCTCCCAAG GAGCAGCCCC 120 

CTCCCCTGCA GCCCCCCCAG CAGTCCATCA GAGTGAAGGA GGAGCAGTAC CTCGGGCACG 180 

AGGGTCCAGG AGGGGCAGTC TCCACCTCTC AGCCTGTGGA ACTGCCCCCT CCTAGCAGCC 240 

TGGCCCTGCT GAACTCTGTG GTGTATGGGC CTGAGCGGAC CTCAGCAGCC ATGCTGTCCC 300 

AGCAGGTGGC CTCAGTAAAG TGGCCCAACT CTGTGATGGC TCCAGGGCGG GGCCCGGAGC 360 

GTGGAGGAGG TGGGGGTGTC AGTGACAGCA GCTGGCAGCA GCAGCCAGGC CAGCCTCCAC 4 20 

CCCATTCAAC ATGGAACTGC CACAGTCTGT CCCTCTACAG TGCAACCAAG GGGAGCCCGC 4 80 
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ATCCTGGAGT GGGAGTCCCG ACTTACTATA ACCACCCTGA GGCACTGAAG CGGGAGAAAG 54 0 

CGGGGGGCCC ACAGCTGGAC CGCTATGTGC GACCAATGAT GCCACAGAAG GTGCAGCTGG 600 

AGGTAGGGCG GCCCCAGGCA CCCCTGAATT CTTTCCACGC AGCCAAGAAA CCCCCAAACC 660 

AGTCACTGCC CCTGCAACCC TTCCAGCTGG CATTCGGCCA CCAGGTGAAC CGGCAGGTCT 720 

TCCGGCAGGG CCCACCGCCC CCAAACCCGG TGGCTGCCTT CCCTCCACAG AAGCAGCAGC 780 

AGCAGCAGCA ACCACAGCAG CAGCAGCAGC AGCAGCAGGC AGCCCTACCC CAGATGCCGC 84 0 

TCTTTGAGAA CTTCTATTCC ATGCCACAGC AACCCTCGCA GCAACCCCAG GACTTTGGCC 900 

TGCAGCCAGC TGGGCCACTG GGACAGTCCC ACCTGGCTCA CCACAGCATG GCACCCTACC 960 

CCTTCCCCCC CAACCCAGAT ATGAACCCAG AACTGCGCAA GGCCCTTCTG CAGGACTCAG 1020 

CCCCGCAGCC AGCGCTACCT CAGGTCCAGA TCCCCTTCCC CCGCCGCTCC CGCCGCCTCT 1080 

CTAAGGAGGG TATCCTGCCT CCCAGCGCCC TGGATGGGGC TGGCACCCAG CCTGGGCAGG 114 0 

AGGCCACTGG CAACCTGTTC CTACATCACT GGCCCCTGCA GCAGCCGCCA CCTGGCTCCC 1200 

TGGGGCAGCC CCATCCTGAA GCTCTGGGAT TCCCGCTGGA GCTGAGGGAG TCGCAGCTAC 1260 

TGCCTGATGG GGAGAGACTA GCACCCAATG GCCGGGAGCG AGAGGCTCCT GCCATGGGCA 1320 

GCGAGGAGGG CATGAGGGCA GTGAGCACAG GGGACTGTGG GCAGGTGCTA CGGGGCGGAG 1380 

TGATCCAGAG CACGCGACGG AGGCGCCGGG CATCCCAGGA GGCCAATTTG CTGACCGTGG 14 40 

CCCAGAAGGC TGTGGAGCTG GCCTCACTGC AGAATGCAAA G GAT GG C AG T GGTTCTGAAG 1500 

AGAAGCGGAA AAGTGTATTG GCCTCAACTA CCAAGTGTGG GGTGGAGTTT TCTGAGCCTT 1560 

Q CCTTAGCCAC CAAGCGAGCA CGAGAAGACA GTGGGATGGT ACCCCTCATC ATCCCAGTGT 1620 

£jj CTGTGCCTGT GCGAACTGTG GACCCAACTG AGGCAGCCCA GGCTGGAGGT CTTGATGAGG 1680 

SJ ACGGGAAGGG TCTTGAACAG AACCCTGCTG AGCACAAGCC ATCAGTCATC GTCACCCGCA 174 0 

□ . GGCGGTCCAC CCGAATCCCC GGGACAGATG CTCAAGCTCA GGCGGAGGAC ATGAATGTCA 1800 

Q AGTTGGAGGG GGAGCCTTCC GTGCGGAAAC CAAAGCAGCG GCCCAGGCCC GAGCCCCTCA 1860 

yj TCATCCCCAC CAAGGCGGGC ACTTTCATCG CCCCTCCCGT CTACTCCAAC ATCACCCCAT 1920 

ifi ACCAGAGCCA CCTGCGCTCT CCCGTGCGCC TAGCTGACCA CCCCTCTGAG CGGAGCTTTG 1980 

AGCTACCTCC CTACACGCCG CCCCCCATCC TCAGCCCTGT GCGGGAAGGC TCTGGCCTCT 204 0 

^ ACTTCAATGC CATCATATCA ACCAGCACCA TCCCTGCCCC TCCTCCCATC ACGCCTAAGA 2100 

jf B GTGCCCATCG CACGCTGCTC CGGACTAACA GTGCTGAAGT AACCCCGCCT GTCCTCTCTG 2160 

TGATGGGGGA GGCCACCCCA GTGAGCATCG AGCCACGGAT CAACGTGGGC TCCCGGTTCC 2220 

H» AGGCAGAAAT CCCCTTGATG AGGGACCGTG CCCTGGCAGC TGCAGATCCC CACAAGGCTG 2280 

h* ACTTGGTGTG GCAGCCATGG GAGGACCTAG AGAGCAGCCG GGAGAAGCAG AGGCAAGT GG 234 0 

yj AAGACCTGCT GACAGCCGCC TGCTCCAGCA TTTTCCCTGG TGCTGGCACC AACCAGGAGC 24 00 

Q TGGCCCTGCA CTGTCTGCAC GAATCCAGAG GAGACATCCT GGAAACGCTG AATAAGCTGC 24 60 

S TGCTGAAGAA GCCCCTGCGG CCCCACAACC ATCCGCTGGC AACTTATCAC TACACAGGCT 2520 

w CTGACCAGTG GAAGATGGCC GAGAGGAAGC TGTTCAACAA AGGCATTGCC ATCTACAAGA 2580 

AGGATTTCTT CCTGGTGCAG AAGCTGATCC AGACCAAGAC CGTGGCCCAG TGCGTGGAGT 2640 

TCTACTACAC CTACAAGAAG CAGGTGAAAA TCGGCCGCAA TGGGACTCTA ACCTTTGGGG 2700 

ATGTGGATAC GAGCGATGAG AAGTCGGCCC AGGAAGAGGT TGAAGTGGAT ATTAAGACTT 27 60 

CCCAAAAGTT CCCAAGGGTG CCT.CTTCCCA GAAGAGAGTC CCCAAGTGAA GAGAGGCTGG 2820 

AGCCCAAGAG GGAGGTGAAG GAGCCCAGGA AGGAGGGGGA GGAGGAGGTG CCAGAGATCC 2880 

AAGAGAAGGA GGAGCAGGAA GAGGGGCGAG AGCGCAGCAG GCGGGCAGCG GCAGTCAAAG 294 0 

CCACGCAGAC ACTACAGGCC AATGAGTCGG CCAGTGACAT CCTCATCCTC CGGAGCCACG 3000 

- AGTCCAACGC CCCTGGGTCT GCCGGTGGCC AGGCCTCGGA GAAGCCAAGG GAAGGGACAG 3060 

GGAAGTCACG AAGGGCACTA CCTTTTTCAG AAAAAAAAAA AAAAAAACAA AAAGCTT 3117 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3306 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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W 



GAATTCGGCA CGAGGTCAGT TTCCTGTGGA ACACAGAGGC TGCCTGTCCC ATTCAGACAA 60 

CGACGGATAC AGACCAGGCT TGCTCTATAA GGGATCCCAA CAGTGGATTT GTGTTTAATC 120 

TTAATCCGCT AAACAGTTCG CAAGGATATA ACGTCTCTGG CATTGGGAAG ATTTTTATGT 180 

TTAATGTCTG CGGCACAATG CCTGTCTGTG GGACCATCCT GGGAAAACCT GCTTCTGGCT 24 0 

GTGAGGCAGA AACCCAAACT GAAGAGCTCA AGAATTGGAA GCCAGCAAGG CCAGTCGGAA 300 

TTGAGAAAAG CCTCCAGCTG TCCACAGAGG GCTTCATCAC TCTGACCTAC AAAGGGCCTC 360 

TCTCTGCCAA AGGTACCGCT GATGCTTTTA TCGTCCGCTT TGTTTGCAAT GATGATGTTT 4 20 

ACTCAGGGCC CCTCAAATTC CTGCATCAAG AT AT CG AC TC TGGGCAAGGG ATCCGAAACA 4 80 

CTTACTTTGA GTTTGAAACC GCGTTGGCCT GTGTTCCTTC TCCAGTGGAC TGCCAAGTCA 54 0 

CCGACCTGGC TGGAAATGAG TACGACCTGA CTGGCCTAAG CACAGTCAGG AAACCTTGGA 600 

CGGCTGTTGA CACCTCTGTC GATGGGAGAA AGAGGACTTT CTATTTGAGC GTTTGCAATC 660 

CTCTCCCTTA CATTCCTGGA TGCCAGGGCA GCGCAGTGGG GTCTTGCTTA GTGTCAGAAG 7 20 

GCAATAGCTG GAATCTGGGT GTGGTGCAGA TGAGTCCCCA AGCCGCGGCG AATGGATCTT 780 

TGAGCATCAT GTATGTCAAC GGTGACAAGT GTGGGAACCA GCGCTTCTCC ACCAGGATCA 840 

CGTTTGAGTG TGCTCAGATA TCGGGCTCAC CAGCATTTCA GCTTCAGGAT GGTTGTGAGT 900 

ACGTGTTTAT CTGGAGAACT GTGGAAGCCT GTCCCGTTGT CAGAGTGGAA GGGGACAACT 960 

GTGAGGTGAA AGACCCAAGG CATGGCAACT TGTATGACCT GAAGCCCCTG GGCCTCAACG 1020 

ACACCATCGT GAGCGCTGGC GAATACACTT ATTACTTCCG GGTCTGTGGG AAGCTTTCCT 1080 

CAGACGTCTG CCCCACAAGT GACAAGTCCA AGGTGGTCTC CTCATGTCAG GAAAAGCGGG 114 0 

AACCGCAGGG ATT TC AC AAA GTGGCAGGTC TCCTGACTCA GAAGCTAACT TATGAAAATG 1200 

GCTTGTTAAA AATGAACTTC ACGGGGGGGG ACACTTGCCA TAAGGTTTAT CAGCGCTCCA 1260 

CAGCCATCTT CTTCTACTGT GACCGCGGCA CCCAGCGGCC AGTATTTCTA AAGGAGACTT 1320 

CAGATTGTTC CTACTTGTTT GAGTGGCGAA CGCAGTATGC CTGCCCACCT TTCGATCTGA 1380 

CTGAATGTTC ATTCAAAGAT GGGGCTGGCA ACTCCTTCGA CCTCTCGTCC CTGTCAAGGT 14 4 0 

ACAGTGACAA CTGGGAAGCC ATCACTGGGA CGGGGGACCC GGAGCACTAC CTCATCAATG 1500 

TCTGCAAGTC TCTGGCCCCG CAGGCTGGCA CTGAGCCGTG CCCTCCAGAA GCAGCCGCGT 1560 

GTCTGCTGGG TGGCTCCAAG CCCGTGAACC TCGGCAGGGT AAGGGACGGA CCTCAGTGGA 1620 

GAGATGGCAT AATTGTCCTG AAATACGTTG ATGGCGACTT ATGTCCAGAT GGGATTCGGA 1680 

AAAAGTCAAC CACCATCCGA TTCACCTGCA GCGAGAGCCA AGTGAACTCC AGGCCCATGT 17 4 0 

TCATCAGCGC CGTGGAGGAC TGTGAGTACA CCTTTGCCTG GCCCACAGCC ACAGCCTGTC 18 00 

CCATGAAGAG CAACGAGCAT GATGACTGCC AGGTCACCAA CCCAAGCACA GGACACCTGT 18 60 

TTGATCTGAG CTCCTTAAGT GGCAGGGCGG GATTCACAGC TGCTTACAGC GAGAAGGGGT 1920 

TGGTTTACAT GAGCATCTGT GGGGAGAATG AAAACTGCCC TCCTGGCGTG GGGGCCTGCT 1980 

TTGGACAGAC CAGGATTAGC GTGGGCAAGG CCAACAAGAG GCTGAGATAC GTGGACCAGG 204 0 

TCCTGCAGCT GGTGTACAAG GATGGGTCCC CTTGTCCCTC CAAATCCGGC CTGAGCTATA 2100 

AGAGTGTGAT CAGTTTCGTG TGCAGGCCTG AGGCCGGGCC AACCAATAGG CCCATGCTCA 2160 

TCTCCCTGGA CAAGCAGACA TGCACTCTCT TCTTCTCCTG GCACACGCCG CTGGCCTGCG 2220 

AGCAAGCGAC CGAATGTTCC GTGAGGAATG GAAGCTCTAT TGTTGACTTG TCTCCCCTTA 2280 

TTCATCGCAC TGGTGGTTAT GAGGCTTATG ATGAGAGTGA GGATGATGCC T C CG AT AC CA 234 0 

ACCCTGATTT CTACATCAAT ATTTGTCAGC CACTAAATCC CATGCACGGA GTGCCCTGTC 24 00 

CTGCCGGAGC CGCTGTGTGC AAAGTTCCTA TTGATGGTCC CCCCATAGAT ATCGGCCGGG 24 60 

TAGCAGGACC ACCAATACTC AATCCAATAG CAAATGAGAT TTACTTGAAT TTTGAAAGCA 2520 

GTACTCCTTG CCAGGAATTC AGTTGTAAAT AAAATTGAAC CTGCTCAACA GCTGAGGGAG 2580 

ACTAGAAATG ATGGGTCCAT ATCCTGGTGC ATTGTCATAC AATTCAAACA ATGGTGCAGC 2 64 0 

TACCAGCTTG TAATTTTTAG GGACTGCAAA CAAGGCTTTT TCTTGAAGCT GAACCAGAAA 27 00 

CAACTTCTTA TGTTCCTTAG GCTTTGTAAT ATGTGCAGGA AT AT AT GG AT ACTGAGGAGG 27 60 

TTCAAAATTT GGTCTCCACC AGTTACCAAT GCAATCGTCA ATGACCCAGT CTTGCAAAAC 2820 

TCCATCCTGA CGACCCAGTA TCTCTGTCAT TAAGCGTTTT AGTCCTTCAA CTTCATCTTC 2880 

TCCTGGGTTA AGTTCACCAC CAGGTAGTTT GAAGAAAGTT GTTCCCAGCT GCAGCAGTAA 2940 

CACATGGGGT AGCCGGTGCT CATGTACAAT CAGAACCCCT TCTACAGTCC TCCTCATTCC 3000 

AATTTTATCA AATTCTTCCC TCATGCGCTG AAATCTGGCT GCAACAGAGC TGTCCTTCTC 3060 

GTAGAGGGGC TCTTTTGTAC CAAAAGTATA ATTGGTAAGA GGGTACAGGT TGATGGTGCG 3120 

CTCCAGGGTG AGGGGCTTCG TCTGCTGGAT GTACTTGTTG CCGAACTGAG TGACCCCCCG 3180 

GGGCCAGCCG GTCTGCGAGC GATTGGGCGG TACCACAGAC ATGCTGGCGA GCTCCGGCGC 324 0 

TGACGGCGAG CAGAAAGTGG CAGGCAGGGT AGACTTTCCC CGTGCGGGAA GCCTCGTGCC 3300 

GAATTC 3306 
(2) INFORMATION FOR SEQ ID N0:8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4218 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

GAATTCGGCA CGAGAATGGA TCAACCTCAA CAACACGTTA AAGCTAGACG AAAGAAGTAA 60 

TACACAGTGT ATGAGTCTCA CATGAAATAC CCGGATGTAA ATCCAAAGAA ACAGGAAGCA 120 

GATTGGTGGT TGCCAGGGAC AAGGGCGGTG GGAGGAGAAA ATGGAGAGTA ACGGGACTTT 180 

ACTTTTGGAG TGATGAGAAT GTTTTGGAGC TAGATAGAAG TGGTGGTTGT ACACCATTGT 24 0 

GGATGTACTA CCACTTAATT GTTCACTTAA AAAGTTAATT TATGTGAATT GCATCTTAAT 300 

TAAAAACAAG GATAACATTC CAACTCCTGG ACATTATCCT TCCTTTCCAT TTGATGTCAG 360 

GCCCGTGTTA GAATTCTCAT CCGGTTTGGT CACTGCACTT AAGATGTGGA GAAATTAGGA 4 20 

s*\ CGCACAGTTA AGAGGAAGGA TAACACTGAT TAAGGTAGTG CTTTTCTAGG TTTCCCCTAA 4 80 

*S ACAATTTAAC AGATGGATAG TGGCACCACT T AC GAG AT GG AAAAACCAGC GGAAGGAAGA 54 0 

~t TTTGGGGGAG AAGTTAAGTT TGTCTTGGGC CTGTGTTTTG CAACCTGAGT GTAAAAGACA 600 

TATGTTAAGT CTTCAGTGGC GAAACACTAA AACTAGAAAT GGATCAGAAT TTTATCTTTG 660 

r? GATGTGACTT CTCAAGGATG GTCTTGTCAC TTCAGTGCCT GGTCAAATGA CAAGATGGGC 720 

O AATCTTTTCC TGAAGGTCCA AGCACCTGAA CGTGGCAGGG TGACCCGATT CCGATTTGCT 780 

W TAGAACAATC CTAGTTCATG CCTATTGTCC CTCATGTAAT TAATATCACT CTCAAAATGT 8 40 

111 CTCATTTTGT GCAATAAATT CTGCAACGTG ATGGCGCGAC TCTCGCGGCC CGAGCGGCCG 900 

JS GACCTTGTCT TCGAGGAAGA GGACCTCCCC TATGAGGAGG AAATCATGCG GAACCAATTC 960 

a TCTGTCAAAT GCTGGCTTCA CTACATCGAG TTCAAACAGG GCGCCCCGAA GCCCAGGCTC 1020 

L£ AATCAGCTAT ACGAGCGGGC ACTCAAGCTG CTGCCCTGCA GCTACAAACT CTGGTACCGA 1080 

L TACCTGAAGG CGCGTCGGGC ACAGGTGAAG CATCGCTGTG TGACCGACCC TGCCTATGAA 1140 

GATGTCAACA ACTGTCATGA GAGGGCCTTT GTGTTCATGC ACAAGATGCC TCGTCTGTGG 1200 

fj CTAGATTACT GCCAGTTCCT CATGGACCAG GGGCGCGTCA CACACACCCG CCGCACCTTC 12 60 

^ GACCGTGCCC TCCGGGCACT GCCCATCACG CAGCACTCTC GAATTTGGCC CCTGTATCTG 1320 

5 CGCTTCCTGC GCTCACACCC ACTGCCTGAG ACAGCTGTGC GAGGCTATCG GCGCTTCCTC 1380 

O AAGCTGAGTC CTGAGAGTGC AGAGGAGTAC ATTGAGTACC TCAAGTCAAG TGACCGGCTG 14 4 0 

GATGAGGCCG CCCAGCGCCT GGCCACCGTG GTGAACGACG AGCGTTTCGT GTCTAAGGCC 1500 

GGCAAGTCCA ACTACCAGCT GTGGCACGAG CTGTGCGACC TCATCTCCCA GAATCCGGAC 1560 

AAGGTACAGT CCCTCAATGT GGACGCCATC ATCCGCGGGG GCCTCACCCG CTTCACCGAC 1620 

CAGCTGGGCA AGCTCTGGTG TTCTCTCGCC GACTACTACA TCCGCAGCGG CCATTTCGAG 1680 

AAGGCTCGGG ACGTGTACGA GGAGGCCATC CGGACAGTGA TGACCGTGCG GG ACT TCACA 17 4 0 

CAGGTGTTTG ACAGCTACGC CCAGTTCGAG GAGAGCATGA TCGCTGCAAA GATGGAGACC 1800 

GCCTCGGAGC TGGGGCGCGA GGAGGAGGAT GATGTGGACC TGGAGCTGCG CCTGGCCCGC 18 60 

- TTCGAGCAGC TCATCAGCCG GCGGCCCCTG CTCCTCAACA GCGTCTTGCT GCGCCAAAAC 1920 

CCACACCACG TGCACGAGTG GCACAAGCGT GTCGCCCTGC ACCAGGGCCG CCCCCGGGAG 1980 

ATCATCAACA CCTACACAGA GGCTGTGCAG ACGGTGGACC CCTTCAAGGC CACAGGCAAG 204 0 

CCCCACACTC TGTGGGTGGC GTTTGCCAAG TTTTATGAGG ACAACGGACA GCTGGACGAT 2100 

GCCCGTGTCA TCCTGGAGAA GGCCACCAAG GTGAACTTCA AGCAGGTGGA TGACCTGGCA 2160 

AGCGTGTGGT GTCAGTGCGG AGAGCTGGAG CTCCGACACG AGAACTACGA TGAGGCCTTG 2220 

CGGCTGCTGC GAAAGGCCAC GGCGCTGCCT GCCCGCCGGG CCGAGTACTT TGATGGTTCA 2280 

GAGCCCGTGC AGAACCGCGT GTACAAGTCA CTGAAGGTCT GGTCCATGCT CGCCGACCTG 234 0 

GAGGAGAGCC TCGGCACCTT CCAGTCCACC AAGGCCGTGT ACGACCGCAT CCTGGACCTG 24 00 

CGTATCGCAA CACCCCAGAT CGTCATCAAC TATGCCATGT TCCTGGAGGA GCACAAGTAC 24 60 

TTCGAGGAGA GCTTCAAGGC GTACGAGCGC GGCATCTCGC TGTTCAAGTG GCCCAACGTG 2520 

TCCGACATCT GGAGCACCTA CCTGACCAAA TTCATTGCCC GCTATGGGGG CCGCAAGCTG 2580 

GAGCGGGCAC GGGACCTGTT TGAACAGGCT CTGGACGGCT GCCCCCCAAA ATATGCCAAG 2 64 0 

ACCTTGTACC TGCTGTACGC ACAGCTGGAG GAGGAGTGGG GCCTGGCCCG GCATGCCATG 2700 

GCCGTGTACG AGCGTGCCAC CAGGGCCGTG GAGCCCGCCC AGCAGTATGA CATGTTCAAC 27 60 
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ATCTACATCA AGCGGGCGGC CGAGATCTAT GGGGTCACCC ACACCCGCGG CATCTACCAG 2820 

AAGGCCATTG AGGTGCTGTC GGACGAGCAC GCGCGTGAGA TGTGCCTGCG GTTTGCAGAC 2880 

ATGGAGTGCA AGCTCGGGGA GATTGACCGC GCCCGGGCCA TCTACAGCTT CTGCTCCCAG 2940 

ATCTGTGACC CCCGGACGAC CGGCGCGTTC TGGCAGACGT GGAAGGACTT TGAGGTCCGG 3000 

CATGGCAATG AGGACACCAT CAAGGAAATG CTGCGTATCC GGCGCAGCGT GCAGGCCACG 3060 

TACAACACGC AGGTCAACTT CATGGCCTCG CAGATGCTCA AGGTCTCGGG CAGTGCCACG 3120 

GGCACCGTGT CTGACCTGGC CCCTGGGCAG AGTGGCATGG ACGACATGAA GCTGCTGGAA 3180 

CAGCGGGCAG AGCAGCTGGC GGCTGAGGCG GAGCGTGACC AGCCCTTGCG CGCCCAGAGC 3240 

AAGATCCTGT TCGTGAGGAG TGACGCCTCC CGGGAGGAGC TGGCAGAGCT GGCACAGCAG 3300 

GTCAACCCCG AGGAGATCCA GCTGGGCGAG GACGAGGACG AGGACGAGAT GGACCTGGAG 3360 

CCCAACGAGG TTCGGCTGGA GCAGCAGAGC GTGCCAGCCG CAGTGTTTGG GAGCCTGAAG 3420 

GAAGACTGAC CCGTCCCCTC GTGCCGAATT CGGCACGAGC AAGACCAGCC CCCAGATCAT 3480 

TTGCCTCAAA GGTTTTCCCT CGAAGTCACA AATGTTTCAA GGAATCTCAA ATTTTACAAA 3540 

GTTTGAAGTG TGGGCATTGG TGGCCTGTGG CTGTGTCCTC TCTCTGTAGC TGTTTTCTCC 3600 

CTACATCCCT GAAAGGAAGT TGAGCCTGCT CCTCCATCCG CAGACCTCCC TTTCCAGCGC 3660 

CCAGGGCATG GGGTGCTGTG AGGGCAGCAT GCTAGGTGTG ACCGTGCTCC TGGCCTCCAG 3720 

GCCCGTGTCC CTCTGTCCTC TAGCCCACTA AGGCCCTGGC CCATTTGTGC TAAACAGGCA 3780 

GTCGGACCTA GAAAGAGCAG ACAATCTCTC TGGGTCACCA GTCTGGCTAG GAGCTGGTCT 3840 

CCTGACTGGG ATCCAGGCCT TCTCCCCTGC CCATGTGAAT TCCCAGGGGC AGAGCCTGAA 3900 

ATGTTGAACA CAGCACTGGC CAAAGAGATG TCACCGTGGG AACCGAGGCT CTCTTCTCCT 3960 

CCTGCCTGCT TTCGTGGGTT CAGAGTAGCT GAGGCTTGTC TGAGAGGAGT TGGAGTGCTG 4 020 

GTTTTCACCC TGGTTGGTGT GCTTTGCTTT GAGGGCACTT AGAAAGCCCA GCCCAGCCCT 4 080 

TGCTCCTGCC CTGCACACAG CGGAGCGACT TTTCTAGGTA TGCTCTTGAT TTCTGCAGAA 4140 

GCAGCAGGTG GCATGGAGCC AAGAGGAAGT GTGACTGAAA CTGTCCACTC ATAGCCCGGC 4200 

TGCCGTATTG AGAGGGCT 4 2 18 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GAGCTCGCGC GCCTGCAGGT CGACACTAGT GGATCCAAAG AATTCGGCAC GAGGGAAACT 60 

CAACGGTGTA CGAGTGGAGG AC AG GG AC AG AGCCCTCTGT GGTGGAACGA CCCCACCTCG 120 

AGGAGCTTCC TGAGCAGGTG GCAGAAGATG CGATTGACTG GGGCGACTTT GGGGTAGAGG 180 

CAGTGTCTGA GGGGACTGAC TCTGGCATCT CTGCCGAGGC TGCTGGAATC GACTGGGGCA 24 0 

- TCTTCCCGGA ATCAGATTCA AAGGATCCTG GAGGTGATGG GATAGACTGG GGAGACGATG 300 

CTGTTGCTTT GCAGATCACA GTGCTGGAAG CAGGAACCCA GGCTCCAGAA GGTGTTGCCA 360 

GGGGCCCAGA TGCCCTGACA CTGCTTGAAT ACACTGAGAC CCGGAATCAG TTCCTTGATG 420 

AGCTCATGGA GCTTGAGATC TTCTTAGCCC AGAGAGCAGT GGAGTTGAGT GAGGAGGCAG 480 

ATGTCCTGTC TGTGAGCCAG TTCCAGCTGG CTCCAGCCAT CCTGCAGGGC CAGACCAAAG 540 

AGAAGATGGT TACCATGGTG TCAGTGCTGG AGGATCTGAT TGGCAAGCTT ACCAGTCTTC 600 

AGCTGCAACA CCTGTTTATG ATCCTGGCCT CACCAAGGTA TGTGGACCGA GTGACTGAAT 660 

TCCTCCAGCA AAAGCTGAAG CAGTCCCAGC TGCTGGCTTT GAAGAAAGAG CTGATGGTGC 720 

AGAAGCAGCA GGAGGCACTT GAGGAGCAGG CGGCTCTGGA GCCTAAGCTG GACCTGCTAC 780 

TGGAGAAGAC CAAGGAGCTG CAGAAGCTGA TTGAAGCTGA CATCTCCAAG AGGTACAGCG 840 

GGCGCCCTGT GAACCTGATG GGAACCTCTC TGTGACACCC TCCGTGTTCT TGCCTGCCCA 900 

TCTTCTCCGC TTTTGGGATG AAGATGATAG CCAGGGCTGT TGTTTTGGGG CCCTTCAAGG 960 

CAAAAGACCA GGCTGACTGG AAGATGGAAA GCCACAGGAA GGAAGCGGCA CCTGATGGTG 1020 

ATCTTGGCAC TCTCCATGTT CTCTACAAGA AGCTGTGGTG ATTGGCCCTG TGGTCTATCA 1080 

GGCGAAAACC ACAGATTCTC CTTCTAGTTA GTATAGCGCA AAAAGCTTCT CGAGAGTACT 114 0 
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TCTAGAGCGG CCGCGGGCCC ATCGATTTTC CACCCGGGTG GGGTACC 1187 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3306 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



CCCTCACTAA AGGGAACAAA AGCTGGAGCT CGCGCGCCTG CAGGTCGACA CTAGTGGATC 60 

GAAAGTTCGT TACGCCAAGC TCGAAATTAA CTCTGGGCTG ACCCATAAAC ATTTGTCTGA 120 

TCTAGGATAT AGTTGCGTTT CTTGCGGGCA GCAATCTGGA TGAGGCGGTT GAGGCACTGG 180 

Q GTGGCCTGCT GGATCAGGAC ATCCCAGCGG CCAGCATAGT TCCGCTGCCG GCGTAGGCCC 24 0 

*fi ATCACCCGCA TCTTATCCAT GATGGCATTG GTACCCAGGA TGTTGTACTT CTTGGAAGGG 300 

M TTGGAGGCTG CATGTTTGAT GGCCCATGTG GTCTTGCCAG CAGCAGGCAG GCCCACCATC 360 

O ATCAGAATCT CACATTCTGC CTTGCTCTTT GGTCCAACGG TGCCCCGGAT ACGCTCACTA 4 20 

g AGGGGAAGGT GCTGGATGAA GGTAAACCCC GGGAGGACAG AACAGTAGGG CTCTGCTCTC 4 80 

yj TGTCCGAAGT TGAACTCCAC TGCGCAATTC TTCACCAGGA CATGAGGATA GAGGGCCTGA 54 0 

f« CCCCCCAAGG CTTCCTTCTG GATTCGGAAA GCAATGCCCA TCCACTTTCC ATTCTTGGTA 600 

^ AAAGACAGTT CCACGTCATT TCCACATTCA AAATCCGCAA AGCAGCCAAT CACCGGAGAG 660 

^ . CTCTGCGGTG CTAGGAGAGC GGCTGGGCCC GCAGACT GGG GGGAAAGCTC CGCAGCCGCA 720 

2 GTGGGCCCCA GGATCAGGCC CCGCGTGGCC TGGAGAAGCC CAGTCTGGGC TGGAGCGGGA 780 

H 1 GCTGGACAGT GTGGCCTTGC GTTCGCCCCC GGGAGCGCTG CGAGTGTCGC GGCCTCGGGT 84 0 

M GGATTTGCTG AGCACCAATA CCTCACGGTT GCCAACCTGG GGTTTTAGCT CCCTTGGTTT 900 

M* TAATCCCCTA GGGGCGGGTG GGGGCACGGG AGGAAGGATG GGCCAGCTGG GTGCAATCCT 960 

yj GCTGTAAGCC AGCCATTCCT TGATTTCTTA GAATTAACTA AACGGTCGCG CCGGAGGCCG 1020 

O CGGGGGCCGG AGCGGAGCAG CCGCGGCTGA GGTTCCCGAG TCGGCCGCTC GGGGCTGCGC 1080 

TCCGCCGCCG GGACCCCGGC CTCTGGCCGC GCCGGCTCCG GCCTCCGGGG GGGCCGGGGC 114 0 

CGCCGGGACA TGGTGCCAGT CGCACCCCTT CCCCGCCGCC GCTGAGCTCG CCGGCCGCGC 1200 

CCGGGCTGGG ACGTCCGAGC GGGAAGATGT TTTCCGCCCT GAAGAAGCTG GTGGGGTCGG 12 60 

ACCAGGCCCC GGGCCGGGAC AAGAACATCC CCGCCGGGCT GCAGTCCATG AACCAGGCGT 1320 

TGCAGAGGCG CTTCGCCAAG GGGGTGCAGT ACAACATGAA GATAGTGATC CGGGGAGACA 1380 

GGAACACGGG CAAGACAGCG CTGTGGCACC GCCTGCAGGG CCGGCCGTTC GTGGAGGAGT 14 4 0 

ACATCCCCAC ACAGGAGATC CAGGTCACCA GCATCCACTG GAGCTACAAG ACCACGGATG 1500 

ACATCGTGAA GGTTGAAGTC TGGGATGTAG TAGACAAAGG AAAATGCAAA AAGCGAGGCG 1560 

ACGGCTTAAA GATGGAGAAC GACCCCCAGG AGNCGGAGTC TGAAATGGCC CTGGATGCTG 1620 

AGTTCCTGGA CGTGTACAAG AACTGCAACG GGGTGGTCAT GATGTTCGAC ATTACCAAGC 1680 

AGTGGACCTT CAATTACATT CTCCGGGAGC TTCCAAAAGT GCCCACCCAC GTGCCAGTGT 17 4 0 

GCGTGCTGGG GAACTACCGG GACATGGGCG AGCACCGAGT CATCCTGCCG GACGACGTGC 1800 

GTGACTTCAT CGACAACCTG GACAGACCTC CAGGTTCCTC CTACTTCCGC TATGCTGAGT 1860 

CTTCCATGAA GAACAGCTTC GGCCTAAAGT ACCTTCATAA GTTCTTCAAT ATCCCATTTT 1920 

TGCAGCTTCA GAGGGAGACG CTGTTGCGGC AGCTGGAGAC GAACCAGCTG GACATGGACG 1980 

CCACGCTGGA GGAGCTGTCG GTGCAGCAGG AGACGGAGGA CCAGAACTAC GGCATCTTCC 204 0 

TGGAGATGAT GGAGGCTCGC AGCCGTGGCC ATGCGTCCCC ACTGGCGGCC AACGGGCAGA 2100 

GCCCATCCCC GGGCTCCCAG TCACCAGTCC TGCCTGCACC CGCTGTGTCC ACGGGGAGCT 2160 

CCAGCCCCGG CACACCCCAG CCCGCCCCAC AGCTGCCCCT CAATGCTGCC CCACCATCCT 2220 

CTGTGCCCCC TGTACCACCC TCAGAGGCCC TGCCCCCACC TGCGTGCCCC TCAGCCCCCG 2280 

CCCCACGGCG CAGCATCATC TCTAGGCTGT TTGGGACGTC ACCTGCCACC GAGGCAGCCC 234 0 

CTCCACCTCC AGAGCCAGTC CCGGCCGCAC AGGGCCCAGC AACGGTCCAG AGTGTGGAGG 24 00 

ACTTTGTTCC TGACGACCGC CTGGACCGCA GCTTCCTGGA AGACACAACC CCCGCCAGGG 24 60 

ACGAGAAGAA GGTGGGGGCC AAGGCTGCCC AGCAGGACAG TGACAGTGAT GGGGAGGCCC 2520 

TGGGCGGCAA CCCGATGGTG GCAGGGTTCC AGGACGATGT GGACCTCGAA GACCAGCCAC 2580 
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GTGGGAGTCC CCCGCTGCCT GCAGGCCCCG TCCCCAGTCA AGACATCACT CTTTCGAGTG 264 0 

AGGAGGAAGC AGAAGTGGCA GCTCCCACAA AAGGCCCTGC CCCAGCTCCC CAGCAGTGCT 2700 

CAGAGCCAGA GACCAAGTGG TCCTCCATAC CAGCTTCGAA GCCACGGAGG GGGACAGCTC 27 60 

CCACGAGGAC CGCAGCACCC CCCTGGCCAG GCGGTGTCTC TGTTCGCACA GGTCCGGAGA 2820 

AGCGCAGCAG CACCAGGCCC CCTGCTGAGA TGGAGCCGGG GAAGGGTGAG CAGGCCTCCT 2880 

CGTCGGAGAG TGACCCCGAG GGACCCATTG CTGCACAAAT GCTGTCCTTC GTCATGGATG 294 0 

ACCCCGACTT TGAGAGCGAG GGATCAGACA CACAGCGCAG GGCGGATGAC TTTCCCGTGC 3000 

GAGATGACCC CTCCGATGTG ACTGACGAGG ATGAGGGCCC TGCCGAGCCG CCCCCACCCC 3060 

CCAAGCTCCC TCTCCCCGCC TTCAGACTGA AGAATGACTC GGACCTCTTC GGGCTGGGGC 3120 

TGGAGGAGGC CGGACCCAAG GAGAGCAGTG AGGAAGGTAA GGAGGGCAAA ACCCCCTCTA 3180 

AGGAGAAGAA AAAAAAAACA AAAAGCTTCT CGAGAGTACT TCTAGAGCGG CCGCGGGCCC 324 0 

ATCGATTTTC CACCCGGGTG GGGTACCAGG TAAGTGTACC CAATTCGCCC TATAGTGAGT 3300 

CGTATT 3306 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
P (B) TYPE: nucleic acid 

£} (C) STRANDEDNESS: single 

\j (D) TOPOLOGY: linear 

□ 

rt (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

^ TGCGGGGCCA GAGTGGGCTG 20 

2 (2) INFORMATION FOR SEQ ID NO: 12: 

M* (i) SEQUENCE CHARACTERISTICS: 
U (A) LENGTH: 20 base pairs 

f:| (B) TYPE: nucleic acid 

S (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GCAGTCCTGG CCTGCGGATG 20 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTCGACAGGA GAATTGGTTC 20 



(2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GCCTGGGTTC GGTGCGGGAC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TGGTCGGGTG TTTGTGAGTG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CCTCTTCCGT CTCCTCAGTG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
GGATTGCTAG TCTCACAGAC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



- 13 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
TTAAGGGTGG CTGAAGGGAC 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
ACCTTCCCTC CCTGTCACAG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20 
TGGTCGGGTG TTTGTGAGTG 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21 
ACACCATTCC AGAAATTCAG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

AAACTGCAGG TGGCTGAGTC 

(2) INFORMATION FOR SEQ ID NO: 23: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
GTCCTAATGT TTTCAGGGAG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
AAAACCTATG GTTACAATTC 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
TCCTAGACAT GGTTCAAGTG 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26 
GATATAATTA GTTCTCCATC 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
ATGCCTGTTC CAGGCTGCAC 

(2) INFORMATION FOR SEQ ID NO;28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
GGACGGCGAC CTCCACCCAC 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GGGCTCCTCC GACGCCTGAG 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
AGTCTAGCCC TGGCCTTGAC 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GTCACTGGGG ACTCCGGCAG 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID.NO:32 
CAGCTTTCCC TGGGCACATG 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
CACAGCTGTC TCAAGCCCAG 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
ACTGTTCCCC CTACATGATG 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :' 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
ATCATATCCT CTTGCTGGTC 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
GTTCCCAGAG CTTGTCTGTG 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
GTTTGGCAGA CTCATAGTTG 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
TAGCAGGGAG CCATGACCTG 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
CTTGGCGCCA GAAGCGAGAG 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
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CCTCTCTCTC TCTCTCTCTC 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
TCCCCGCTGA TTCCGCCAAG 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base -pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 



(xi) SEQUENCE- DESCRIPTION: SEQ ID NO: 42 
CTTTTTGAAT TCGGCACGAG 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
CCCCTGGTCC GCACCAGTTC 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 4 

GAGAAGGGTC GGGGCGGCAG 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 5 
AAATCACATC GCGTCAACAC 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 
TAAGAGAGTC ATAGTTACTC 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 
GCTCTAGAAG TACTCTCGAG 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 
ACTCTGGCCA TCAGGAGATC 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 
CAGGCGTTGT AGATGTTCTG 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 
AGTGGCAGGC AGAAGTAATG 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 
GGTTGGAGAA CTGGATGTAG 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 
CTATTCAGAT GCAACGCCAG 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 
CCATGGCACA C AG AG C AG AC 

(2) INFORMATION FOR SEQ ID NO: 54: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54 
GCTACCATGC AGAGACACAG 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 
CAGGCTGACA AGAAAATCAG 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 
GGCACGCATA GAGGAGAGAC 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57 
TGGGTGATGC CTTTGCTGAC 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 
AAAACAAGAT CAAGGTGATG 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 
TTGCCCACAT TGCTATGGTG 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 
GACCAAGATC AGAAGTAGAG 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 
CCCCTGGGCC AATGATGTTG 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 



TCTTCCCACC ATAGCAATG 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 
TGGTCTTGGT GACCAATGTG 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 
ACACCTCGGT GACCCCTGTG 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 
TCTCCAAGTT CGGCACAGTG 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ TD NO: 66 
ACATGGGCTG CACTCACGAC 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 
GATCCTCTGA ACCTGCAGAG 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 
GGAAATGAGG TGGGGCGATC 

(2) INFORMATION FOR SEQ ID NO:69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 
CTTTGCCTTG GACAAGGATG 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 
GCACCTGCCA TTGGGGGTAG 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 



GGTGGAAGCC ATTGACGGTG 

(2) INFORMATION FOR SEQ ID NO:72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 
TGCGTCTCTC GTCGCTGCTG 

(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 
GCGGAAACTC TGTGGTGCTG 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 
AGGATTGCCT TCCTCTACTG 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 

TGTCTGTTTC ACCAGGGCAG 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 
CCAGTGCCTC TATGCATGTC 

(2) INFORMATION FOR SEQ ID NO:77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 
AGGAAGCCCA CGCACACCAC 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 
CCCTTTGTTC CCTGATCTTC 

(2) INFORMATION FOR SEQ ID NO:79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 
CGCTCGGGAT CCAGGTCATC 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 



TCGAGGTTCA GAGCGTAGTG 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 
TCTTGGATCT CTGGCACCTC 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 
CCATCAGAGT GAAGGAGGAG 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83 
CCATCTTCCA CTGGTCAGAG 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 
CTCCTTCTCT TGGATCTCTG 

(2) INFORMATION FOR SEQ ID NO: 85: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 
TTACTTCAGC ACTGTTAGTC 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 
AGGGAGGTAG CTCAAAGCTC 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 
TGGGTCCACA GTTCGCACAG 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 
CAACTCTGTG ATGGCTCCAG 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 
AGCAGGGTTC TGTTCAAGAC 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 
CCATTGGGTG CTAGTCTCTC 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 
CAGCCATGCT GTCCCAGCAG 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 
CTGGACCTGA GGTAGCGCTG 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 
ATAACCACCC TGAGGCACTG 

(2) INFORMATION FOR SEQ ID NO: 94: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 
CCTGCAGGTC GACACTAGTG 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 
AATTGGAATG AGGAGGACTG 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 
GCTCTAGAAG TACTCTCGAG 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ TD NO: 97 
ATTGTATGAC AATGCACCAG 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
TCCACAGAGG GCTTCATCAC 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
CCTGACTGGC CTAAGCACAG 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 
AAGCCTCATA ACCACCAGTG 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 
TGTCAACGGT GACAAGTGTG 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 



TTGTACACCA GCTGCAGGTC 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 
GGGTGTGGTG CAGATGAGTC 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 
ATCACACTCT TATAGCTCAG 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 
GTGGGAAGCT TTCCTCAGAC 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 

TGATGAACAT GGGCCTGGAG 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 
CATTGTGGAT GTACTACCAC 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 
TGTGTTTTGC AACCTGAGTG 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 
ATAGTGGCAC CACTTACGAG 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 
AATTCTGCAA CGTGATGGCG 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 
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CACAAGATGC CTCGTCTGTG 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 
AATCCGGACA AGGTACAGTC 

(2) INFORMATION FOR SEQ ID NO:113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 
GCACGAGTGG CACAAGCGTG 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 
GCAAGCGTGT GGTGTCAGTG 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 
TGTTTGAACA GGCTCTGGAC 

(2) INFORMATION FOR SEQ ID NO: 116: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 
CGGCATGGCA AT GAG G AC AC 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 
AGGACGAGAT GGACCTCCAG 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 
CCCTCTGTCC TCTAGCCCAC 

(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 
TCTTGAGGGG ACTGACTCTG 

(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 
TGAGTGAGGA GGCAGATGTC 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 
TGGCTTTGAA GAAAGAGCTG 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 
GCAAAAGACC AGGCTGACTG 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 
TGCAGCTCCT TGGTCTTCTC 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 
GATTCACAGT CCCAAGGCTC 

(2) INFORMATION FOR SEQ ID NO: 125: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 
ATCTGGATGA GGCGGTTGAG 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126 
GGTCACTCTC CGACGAGGAG 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 
GGATCCAAAG TTCGTCTCTG 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128 
CGCTGTGTGT CTGATCCCTC 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 
ATGAAGGTAA ACCCCGGGAG 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 
TGGTCTCTGG CTCTGAGCAC 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 
GCCTGGAGAA GCCCAGTCTG 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 
CACACTCTGG ACCGTTGCTG 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: ■ 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 



AAAGCTCCGC AGCCGCAGTG 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 
TCTTCCAGGA AGCTGCGGTC 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 
GATGGTGGGG CAGCATTGAG 

(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 
GTCACCAGTG GTGCCTGCAG 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 
ACCTCACGGT TGCCAACCTG 

(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138 
CGCAACAGCG TCTCCCTCTG 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 
AGTACCTTCA TAAGTTCTTC 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 
TCCCAGACTT CAACCTTCAC 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 
AAACATCTTC CCGGTCGGAC 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 



GCTGAGCACC TTTACCTCAC 



(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 
GACGTCCGTC CGGGAAGATG 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 4 
ACACAGGAGA TGCAGGTCAC 

(2) INFORMATION FOR SEQ ID NO: 14 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 
GAGTCTTCCA TGAAGAACAG 

(2) INFORMATION FOR SEQ ID NO: 14 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 
GCAGTGAGGA AGGTAAGGAG 

(2) INFORMATION FOR SEQ ID NO: 147: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 4047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 378... 1799 
(D) OTHER INFORMATION : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

GGATCCAAAG GACGCCCCCG CCGACAGGAG AATTGGTTCC CGGGCCCGCG GCGATGCCCC 60 

CCCGGTAGCT CGGGCCCGTG GTCGGGTGTT TGTGAGTGTT TCTATGTGGG AGAAGGAGGA 120 

GGAGGAGGAA GAAGAAGCAA CGATTTGTCT TCTCGGCTGG TCTCCCCCCG GCTCTACATG 180 

TTCCCCGCAC TGAGGAGACG GAAGAGGAGC CGTAGCCGCC CCCCCTCCCG GCCCGGATTA 240 

TAGTCTCTCG CCACAGCGGC CTCGGCCTCC CCTTGGATTC AGACGCCGAT TCGCCCAGTG 300 

TTTGGGAAAT GGGAAGTAAT GACAGCTGGC ACCTGAACTA AGTACTTTTA TAGGCAACAC ' 360 

CATTCCAGAA ATTCAGG ATG AAT GGG GAT ATG CCC CAT GTC CCC ATT ACT 410 

Met Asn Gly Asp Met Pro His Val Pro He Thr 



SI 1 5 10 

o 

Q ACT CTT GCG GGG ATT GCT AGT CTC ACA GAC CTC CTG AAC CAG CTG CCT 4 58 

h) Thr Leu Ala Gly He Ala Ser Leu Thr Asp Leu Leu Asn Gin Leu Pro 

|J] 15 20 25 

CTT CCA TCT CCT TTA CCT GCT ACA ACT ACA AAG AGC CTT CTC TTT AAT 506 

f. Leu Pro Ser Pro Leu Pro Ala Thr Thr Thr Lys Ser Leu Leu Phe Asn 

30 35 40 

^ GCA CGA ATA GCA GAA GAG GTG AAC TGC CTT TTG GCT TGT AGG GAT GAC 554 

W Ala Arg He Ala Glu Glu Val Asn Cys Leu Leu Ala Cys Arg Asp Asp 

Q 45 50 55 

n 

= AAT TTG GTT TCA CAG CTT GTC CAT AGC CTC AAC CAG GTA TCA ACA GAT 602 

Asn Leu Val Ser Gin Leu Val His Ser Leu Asn Gin Val Ser Thr Asp 

60 65 70 75 

CAC ATA GAG TTG AAA GAT AAC CTT GGC AGT GAT GAC CCA GAA GGT GAC 650 

His He Glu Leu Lys Asp Asn Leu Gly Ser Asp Asp Pro Glu Gly Asp 
80 85 90 

ATA CCA GTC TTG TTG CAG GCC GTC CTG GCA AGG AGT CCT AAT GTT TTC 698 

He Pro Val Leu Leu Gin Ala Val Leu Ala Arg Ser Pro Asn Val Phe 

95 100 105 

AGG GAG AAA AGC ATG CAG AAC AGA TAT GTA CAA AGT GGA ATG ATG ATG 74 6 

Arg Glu Lys Ser Met Gin Asn Arg Tyr Val Gin Ser Gly Met Met Met 

110 115 120 

TCT CAG TAT AAA CTT TCT CAG AAT TCC ATG CAC AGT AGT CCT GCA TCT 794 

Ser Gin Tyr Lys Leu Ser Gin Asn Ser Met His Ser Ser Pro Ala Ser 

125 130 135 



TCC AAT TAT CAA CAA ACC ACT ATC TCA CAT AGC CCC TCC AGC CGG TTT 
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Ser Asn Tyr Gin Gin Thr Thr lie Ser His Ser Pro Ser Ser Arg Phe 
140 145 150 155 



GTG CCA CCA CAG ACA AGC TCT GGG AAC AGA TTT ATG CCA CAG CAA AAT 
Val Pro Pro Gin Thr Ser Ser Gly Asn Arg Phe Met Pro Gin Gin Asn 
160 165 170 

AGC CCA GTG CCT AGT CCA TAC GCC CCA CAA AGC CCT GCA GGA TAC ATG 
Ser Pro Val Pro Ser Pro Tyr Ala Pro Gin Ser Pro Ala Gly Tyr Met 
175 180 185 

CCA TAT TCC CAT CCT TCA AGT TAC ACA ACA CAT CCA CAG ATG CAA CAA 
Pro Tyr Ser His Pro Ser Ser Tyr Thr Thr His Pro Gin Met Gin Gin 
190 195 200 

GCA TCG GTA TCA AGT CCC ATT GTT GCA GGT GGT TTG AGA AAC ATA CAT 
Ala Ser Val Ser Ser Pro lie Val Ala Gly Gly Leu Arg Asn lie His 
205 210 215 

GAT AAT AAA GTT TCT GGT CCG TTG TCT GGC AAT TCA GCT AAT CAT CAT 
Asp Asn Lys Val Ser Gly Pro Leu Ser Gly Asn Ser Ala Asn His His 
220 225 230 235 

GCT GAT AAT CCT AGA CAT GGT TCA AGT GAG GAC TAC CTA CAC ATG GTG 
Ala Asp Asn Pro Arg His Gly Ser Ser Glu Asp Tyr Leu His Met Val 
240 245 250 

CAC AGG CTA AGT AGT GAC GAT GGA GAT TCT TCA ACA ATG AGG AAT GCT 
His Arg Leu Ser Ser Asp Asp Gly Asp Ser Ser Thr Met Arg Asn Ala 
255 260 265 

GCA TCT TTT CCC TTG AGA TCT CCA CAG CCA GTA TGC TCC CCT GCT GGA 
Ala Ser Phe Pro Leu Arg Ser Pro Gin Pro Val Cys Ser Pro Ala Gly 
270 275 280 

AGT GAA GGA ACT CCT AAA GGC TCA AGA CCA CCT TTA ATC CTA CAA TCT 
Ser Glu Gly Thr Pro Lys Gly Ser Arg Pro Pro Leu lie Leu Gin Ser 
285 290 295 

CAG TCT CTA CCT TGT TCA TCA CCT CGA GAT GTT CCA CCA GAT ATC TTG 
Gin Ser Leu Pro Cys Ser Ser Pro Arg Asp Val Pro Pro Asp lie Leu 
300 305 310 315 

CTA GAT TCT CCA GAA AGA AAA CAA AAG AAG CAG AAG AAA ATG AAA TTA 
Leu Asp Ser Pro Glu Arg Lys Gin Lys Lys Gin Lys Lys Met Lys Leu 
320 325 330 

GGC AAG GAT GAA AAA GAG CAG AGT GAG AAA GCG GCA ATG TAT GAT ATA 
Gly Lys Asp Glu Lys Glu Gin Ser Glu Lys Ala Ala Met Tyr Asp lie 
. 335 340 345 

ATT AGT TCT CCA TCC AAG GAC TCT ACT AAA CTT ACA TTA AGA CTT TCT 
lie Ser Ser Pro Ser Lys Asp Ser Thr Lys Leu Thr Leu Arg Leu Ser 
350 355 360 

CGT GTA AGG TCT TCA GAC ATG GAC CAG CAA GAG GAT ATG ATT TCT GGT 
Arg Val Arg Ser Ser Asp Met Asp Gin Gin Glu Asp Met lie Ser Gly 
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365 370 375 

GTG GAA AAT AGC AAT GTT TCA GAA AAT GAT ATT CCT TTT AAT GTG CAG 1562 
Val Glu Asn Ser Asn Val Ser Glu Asn Asp lie Pro Phe Asn Val Gin 
380 385 390 395 

TAC CCA GGA CAG ACT TCA AAA ACA CCC ATT ACT CCA CAA GAT ATA AAC 1610 
Tyr Pro Gly Gin Thr Ser Lys Thr Pro lie Thr Pro Gin Asp lie Asn 
400 405 410 

CGC CCA CTA AAT GCT GCT CAA TGT TTG TCG CAG CAA GAA CAA ACA GCA 1658 
Arg Pro Leu Asn Ala Ala Gin Cys Leu Ser Gin Gin Glu Gin Thr Ala 
415 420 425 

TTC CTT CCA GCA AAT CAA GTG CCT GTT TTA CAA CAG AAC ACT TCA GTT 1706 
Phe Leu Pro Ala Asn Gin Val Pro Val Leu Gin Gin Asn Thr Ser Val 
430 435 440 

GCT GCA AAA CAA CCC CAG ACC AAT AGT CAC AAA ACC TTG GTG CAG CCT 1754 
Ala Ala Lys Gin Pro Gin Thr Asn Ser His Lys Thr Leu Val Gin Pro 
445 450 455 

GGA ACA GGC ATA GAG GTC TCA GCA GAG CTG CCC AAG GAC AAG ACC TAAGA 1804 
Gly Thr Gly lie Glu Val Ser Ala Glu Leu Pro Lys Asp Lys Thr 
460 465 470 

TCCAGCAGGG AAC T AT G TAG TCACCCCGAG AGGCCCAGCT CTCTCCGTGA GCTCTGGGCC 18 64 

TAGGGTGGGG GTGGTTGTTG GTTCTGCGCG CACTGTTCCC CCT AC AT GAT GGGTCCATCC 1924 

CAGTTGGCTT CTCTCACTCG CTTCCTCCTG TGGAGAAGCC TGTCCAGGTG TCACTGCCTC 198 4 

CAGGAAGCTG TCTCTGATTT CTCCAGTTGA ACAGTGAGAT TTGCCACACC TCACATGCAT 204 4 

CGCTCTTGTC CCTGGAATTG TAACCATAGG TTTTCCTGTC TCCTGGAGGA CAAGGATGAG 2104 

GGCTTTCCAC TTGAGTCTCC CTGGTGGAGC CCAGCTCCTG ACATACCTGG TAAAAGTTCT 2164 

CAAGAGAAGA ACATGGAGGA GGAATGTGGA T AAC AAC CCT GGCTGCCTGT GTGTTCCAAG 2224 

CTAGGAAGAT GTAATGTCCC CACAAACGGG GTAAATGGCT TGCCTGCGTC ACAGCTGTCT 2284 

CAAGCCCAGG CCCTGGGCGC CAGCCCAAGC CCAAGGACTA GGTCCAGAGC CACACAGCGC 234 4 

CAGGCCACAT CCGCCTCACC TGGGACCCTT TGTGGGGTAC AGTCTCCGGC CCCACCCAGA 24 04 

CCTCCTGAAG GAGAGACCCC ATGGCAAGGA CTCAGCCACC TGCAGTTTCA TAAGCCCCCA 24 64 

GTGGGTTCCT AGGCATGAAG ACCACCGGTT AGAGGCTGAA CTGGCAGGAA CCTGTCTCCA 2524 

GCCCCTTCTC ACCCCAGCCG GGCCCTGCCT CAGAGGCAGC ACCCAGGACG TGGCCATGAC 2584 

CCGTGGACTC CACTCAATCC CTCTTCTCCA GGAGCCATGC AAAGTGTCAG CCAGCCAGGC 264 4 

CCCTGGAAGG CAGTCATCAC CTCTTAAGGC ATTGTGGGTG TCGGTCCTGC AACTGCCAGG 2704 

TGCAGCACAC GACCCGTGTC CGGTGTTCGA TAGCAGGGAG CCATGACCTG GCAACGATTC 27 64 

CACGCTCAAA GGGGCACCCG GGGGGCCCTG GGTCGGGGCG GATCAGCTTT CCCTGGGCAC 2824 

ATCTGCCTCA TTCCAGATCT CCAGGGCTCA TGTCTGTGAC AGGGAGGGAA GGCTCTGCCC 28 8 4 

TGGCCTTCCG TCAGCTCTGC CAGTGCAGGC TGGGCAGCCT GGGCTTTAGA GCTGGCTTCT 294 4 

GCCCACACTT TCTCCGTGAA AGGAAAACAA CTATGAGTCT GCCAAACGCA TCTCAGATGC 3004 

GTTTTAAAAA ATTCTGGTCC CCGCTCTCTG TCCCATCATC CGCCTCGGGG ACTTCCTCTC 3064 

TCCGTGGTTC TCACCCCATA CTCTGTCACT GCCACATTTT CACCTGGGCC TGGCCTTTGT 3124 

CTCCACCTGA AACTCCTGAA AATCTTGAAA TGGATTTCTA GGTCACTGGG GACTCCGGCA 3184 

GCACATTCGG CTTCAGAATA AAGGGCGCCC GCGGTCCCCC AGCACCTCCC CAAGCCACAC 324 4 

CCCTAGCTTC CCTCCCTATC CCTGCAGCCT GAGGGTCCCT TCAGCCACCC TTAAGTCCCC 3304 

ACCTGGGCTC CTGCCCCGCC CCTGGCTAGC AGCGCCTTCT CCACCGGGGC CCCCTCTGCT 3364 

CACAGAGCCC CCTCACCTCC CTGGGGATGA GGGGCCAGGC CATGACCCTG AAAGTCTAGC 34 24 

CCTGGCCTTG ACCTCCCAGG AGCGCCCTCC CCGCCCTCTC CCGGCCCCGG CCCCGTCCTC 34 84 

TGCTGCTGGC CTCTGGGTCG TGCCCCGCAG ACTGAGCTGC GCTTGGGGGT CCTGGCGGCC 354 4 

TGGGCCGTCC CGCACCGAAC CCAGGCGGTC GGAGCCCGGC GGGAAGGCGC GAGGTCCTTC 3604 

TGGGGGCTCC TCCGACGCCT GAGGGCGCTG CTTCCCCGCG GCCGCCCCGG GTTTCTGCGG 3664 
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AGCCGGGGCC TCCGCTCTCG GGTGACCCGG TGAGACCCCC GGGGAGGCCG CTGGGGAGGC 3724 

GCGGGCTCTG CTCCCGGGTC CCAAACGCAC TGGCTGCCCC TCAGGAGGGA CGGCGACCTC 37 84 

CACCCACGGC GCTGGCGCCC GCACGGCCGC TCCTCCCGCT CCCGCAGCCT GGACGCCTCC 38 4 4 

CGAGGCCGCC CCGCCGGGCC CCACGCGCGG CCCCATCCGC AGGCCAGGAC TGCCTTCCCG 3 904 

GAGCTGGCGG CCCCCAGCCT GGAGGAGCCG GCCCCAGACG CCCTCCCAGC CCTCCCCAGC 3964 

CCACTCTGGC CCCGCAGCCC CCGCCTGGTC CGAGTGCGGG TCTCTGGCCC CGGCCTTTCC 4 024 

CGGGGAAGGA AAGCAAAAAG CTT 4 047 

(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



Q 




(xi) SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO: 


148 : 












Met 


Asn 


Gly 


Asp 


Met 


Pro 


His 


Val 


Pro 


He 


Thr 


Thr 


Leu 


Ala 


Gly 


He 


H 


1 








5 










10 










15 






Ala 


Ser 


Leu 


Thr 


Asp 


Leu 


Leu 


Asn 


Gin 


Leu 


Pro 


Leu 


Pro 


Ser 


Pro 


Leu 










20 










25 
















3 1 

UJ 


Pro 


Ala 


Thr 


Thr 


Thr 


Lys 


Ser 


Leu 


Leu 


Phe 


Asn 


Ala 


Arg 


lie 


Ala 


GlU 


3 Pi 
Ul 






35 










40 










A ET 
H D 










Glu 


Val 


Asn 


Cys 


Leu 


Leu 


Ala 


Cys 


Arg 


Asp 


Asp 


Asn 


Leu 


val 


C n -v- 

ber 


bin 






50 










55 










ten 










H= 


Leu 


Vdl 


nlS 


Ser 


Leu 


Asn 


bin 


va JL 




inr 


Asp 


LI -i f> 

HIS 


lie 


blU 


Leu 


Lys 




65 










70 










75 










80 




Asp 


Asn 


Leu 


Gly 


Ser 


Asp 


Asp 


Pro 


Glu 


Gly 


Asp 


He 


Pro 


Val 


Leu 


Leu 












85 










90 










95 




y 


" Gin 


Ala 


Val 


Leu 


Ala 


Arg 


Ser 


Pro 


Asn 


Val 


Phe 


Arg 


Glu 


Lys 


Ser 


Met 










100 










105 










110 






□ 


Gin 


Asn 


Arg 


Tyr 


Val 


Gin 


Ser 


Gly 


Met 


Met 


Met 


Ser 


Gin 


Tyr 


Lys 


Leu 








115 










120 










125 










Ser 


Gin 


Asn 


Ser 


Met 


His 


Ser 


Ser 


Pro 


Ala 


Ser 


Ser 


Asn 


Tyr 


Gin 


Gin 






130 










135 










140 












Thr 


Thr 


He 


Ser 


His 


Ser 


Pro 


Ser 


Ser 


Arg 


Phe 


Val 


Pro 


Pro 


Gin 


Thr 




145 










150 










155 










160 




Ser 


Ser 


Gly 


Asn 


Arg 


Phe 


Met 


Pro 


Gin 


Gin 


Asn 


Ser 


Pro 


Val 


Pro 


Ser 












165 










170 










175 






Pro 


Tyr 


Ala 


Pro 


Gin 


Ser 


Pro 


Ala 


Gly 


Tyr 


Met 


Pro 


Tyr 


Ser 


His 


Pro 










180 










185 










190 








Ser 


Ser 


Tyr 


Thr 


Thr 


His 


Pro 


Gin 


Met 


Gin 


Gin 


Ala 


Ser 


Val 


Ser 


Ser 








195 










200 










205 










Pro 


He 


Val 


Ala 


Gly 


Gly 


Leu 


Arg 


Asn 


He 


His 


Asp 


Asn 


Lys 


Val 


Ser 






210 










215 










220 












Gly 


Pro 


Leu 


Ser 


Gly 


Asn 


Ser 


Ala 


Asn 


His 


His 


Ala 


Asp 


Asn 


Pro 


Arg 




225 










230 










235 










240 




His 


Gly 


Ser 


Ser 


Glu 


Asp 


Tyr 


Leu 


His 


Met 


Val 


His 


Arg 


Leu 


Ser 


Ser 












245 










250 










255 






Asp 


Asp 


Gly 


Asp 


Ser 


Ser 


Thr 


Met 


Arg 


Asn 


Ala 


Ala 


Ser 


Phe 


Pro 


Leu 










260 










265 










270 








Arg 


Ser 


Pro 

one. 


Gin 


Pro 


Val 


Cys 


Ser 
oq n 


Pro 


Ala 


Gly 


Ser 


Glu 


Gly 


Thr 


Pro 



275 280 285 
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Lys Gly Ser Arg Pro Pro Leu He Leu Gin Ser Gin Ser Leu Pro Cys 

290 295 300 

Ser Ser Pro Arg Asp Val Pro Pro Asp He Leu Leu Asp Ser Pro Glu 
305 310 315 320 

Arg Lys Gin Lys Lys Gin Lys Lys Met Lys Leu Gly Lys Asp Glu Lys 

325 330 335 

Glu Gin Ser Glu Lys Ala Ala Met Tyr Asp He He Ser Ser Pro Ser 

340 345 350 

Lys Asp Ser Thr Lys Leu Thr Leu Arg Leu Ser Arg Val Arg Ser Ser 

355 360 365 

Asp Met Asp Gin Gin Glu Asp Met He Ser Gly Val Glu Asn Ser Asn 

370 375 380 

Val Ser Glu Asn Asp He Pro Phe Asn Val Gin Tyr Pro Gly Gin Thr 
385 390 395 400 

Ser Lys Thr Pro He Thr Pro Gin Asp He Asn Arg Pro Leu Asn Ala 

405 410 415 

Ala Gin Cys Leu Ser Gin Gin Glu Gin Thr Ala Phe Leu Pro Ala Asn 

420 425 430 

Gin Val Pro Val Leu Gin Gin Asn Thr Ser Val Ala Ala Lys Gin Pro 
□ 435 440 445 

yg Gin Thr Asn Ser His Lys Thr Leu Val Gin Pro Gly Thr Gly He Glu 
%j 450 455 460 

g Val Ser Ala Glu Leu Pro Lys Asp Lys Thr 
m 465 470 



(2) INFORMATION FOR SEQ ID NO: 149: 



hi 

m 

' (i) SEQUENCE CHARACTERISTICS: 

3 (A) LENGTH: 2998 base pairs 

(B) TYPE: nucleic acid 
M> (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

w 

(ii) MOLECULE TYPE: Genomic DNA 
~ (ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 26... 799 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

- AAGCTTTTTG AATTCGGCAC GAGAT GCT ACA CAG GCT ATA TTT GAA ATA CTG 52 

Ala Thr Gin Ala He Phe Glu He Leu 
1 5 

GAG AAA TCC TGG TTG CCC CAG AAT TGT ACA CTG GTT GAT ATG AAG ATT 100 
Glu Lys Ser Trp Leu Pro Gin Asn Cys Thr Leu Val Asp Met Lys He 
10 15 20 25 

GAA TTT GGT GTT GAT GTA ACC ACC AAA GAA ATT GTT CTT GCT GAT GTT 148 
Glu Phe Gly Val Asp Val Thr Thr Lys Glu He Val Leu Ala Asp Val 
30 35 40 

ATT GAC AAT GAT TCC TGG AGA CTC TGG CCA TCA GGA GAT CGA AGC CAA 196 
He Asp Asn Asp Ser Trp Arg Leu Trp Pro Ser Gly Asp Arg Ser Gin 
45 50 55 
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CAG 
Gin 


AAA 
Lys 


GAC 
Asp 
60 


AAA 
Lys 


CAG 
Gin 


TCT 
Ser 


TAT 
Tyr 


CGG 
Arg 
65 


GAC 
Asp 


CTC 
Leu 


AAA 
Lys 


GAA 
Glu 


GTA 
Val 
70 


ACT 
Thr 


CCT 
Pro 


GAA 
Glu 


244 




GGG 
Gly 


CTC 
Leu 
75 


CAA 
Gin 


ATG 
Met 


GTA 
Val 


AAG 
Lys 


AAA 
Lys 
80 


AAC 
Asn 


TTT 
Phe 


GAG 
Glu 


TGG 
Trp 


GTT 
Val 
85 


GCA 
Ala 


GAG 
Glu 


AGA 
Arg 


GTA 
Val 


292 




GAG 
Glu 
90 


TTG 
Leu 


CTT 
Leu 


TTG 
Leu 


AAA 
Lys 


TCA 
Ser 
95 


GAA 
Glu 


AGT 
Ser 


CAG 
Gin 


TGC 
Cys 


AGG 
Arg 
100 


GTT 
Val 


GTA 
Val 


GTG 
Val 


TTG 
Leu 


ATG 
Met 
105 


340 




GGC 
Gly 


TCT 
Ser 


ACT 
Thr 


TCT 
Ser 


GAT 
Asp 
110 


CTT 
Leu 


GGT 
Gly 


CAC 
His 


TGT 
Cys 


GAA 
Glu 
115 


AAA 
Lys 


ATC 
He 


AAG 
Lys 


AAG 
Lys 


GCC 
Ala 
120 


TGT 
Cys 


388 




GGA 
Gly 


AAT 
Asn 


TTT 
Phe 


GGC 
Gly 
125 


ATT 
He 


CCA 
Pro 


TGT 
Cys 


GAA 
Glu 


CTT 
Leu 
130 


CGA 
Arg 


GTA 
Val 


ACA 
Thr 


TCT 
Ser 


GCG 
Ala 
135 


CAT 
His 


AAA 
Lys 


436 


%\ : 

H 

S ; 

5 


GGA 
Gly 


CCA 
Pro 


GAT 
Asp 
140 


GAA 
Glu 


ACT 
Thr 


CTG 
Leu 


AGG 
Arg 


ATT 
He 
145 


AAA 
Lys 


GCT 
Ala 


GAG 
Glu 


TAT 
Tyr 


GAA 
Glu 
150 


GGG 
Gly 


GAT 
Asp 


GGC 
Gly 


484 


w 

: F: 


ATT 
He 


CCT 
Pro 
155 


ACT 
Thr 


GTA 
Val 


TTT 
Phe 


GTG 
Val 


GCA 
Ala 
160 


GTG 
Val 


GCA 
Ala 


GGC 
Gly 


AGA 
Arg 


AGT 
Ser 
165 


AAT 
Asn 


GGT 
Gly 


TTG 
Leu 


GGA 
Gly 


532 


3 

M. 


CCA 
Pro 
170 


GTG 
Val 


ATG 
Met 


TCT 
Ser 


GGG 
Gly 


AAC 
Asn 
175 


ACT 
Thr 


GCA 
Ala 


TAT 
Tyr 


CCA 
Pro 


GTT 
Val 
180 


ATC 
He 


AGC 
Ser 


TGT 
Cys 


CCT 
Pro 


CCC 
Pro 
185 


580 


s - 3 

%M 
U 
O 


CTC 
Leu 


ACA 
Thr 


CCA 
Pro 


GAC 
Asp 


TGG 
Trp 
190 


GGA 
Gly 


GTT 
Val 


CAG 
Gin 


GAT 
Asp 


GTG 
Val 
195 


TGG 
Trp 


TCT 
Ser 


TCT 
Ser 


CTT 
Leu 


CGA 
Arg 
200 


CTA 
Leu 


628 




CCC 
Pro 


AGT 
Ser 


GGT 
Gly 


CTT 
Leu 
205 


GGC 
Gly 


TGT 
Cys 


TCA 
Ser 


ACC 
Thr 


GTA 
Val 
210 


CTT 
Leu 


TCT 
Ser 


CCA 
Pro 


GAA 
Glu 


GGA 
Gly 
215 


TCA 
Ser 


GCT 
Ala 


67 6 




CAA 
- Gin 


TTT 
Phe 


GCT 
Ala 
220 


GCT 
Ala 


CAG 
Gin 


ATA 
He 


TTT 
Phe 


GGG 
Gly 
225 


TTA 
Leu 


AGC 
Ser 


AAC 
Asn 


CAT 
His 


TTG 
Leu 
230 


GTA 
Val 


TGG 
Trp 


AGC 
Ser 


724 




AAA 
Lys 


CTG 
Leu 
235 


CGA 
Arg 


GCA 
Ala 


AGC 
Ser 


ATT 
He 


TTG 
Leu 
240 


AAC 
Asn 


ACA 
Thr 


TGG 
Trp 


ATT 
He 


TCC 
Ser 
245 


TTG 
Leu 


AAG 
Lys 


CAG 
Gin 


GCT 
Ala 


772 




GAC 
Asp 


AAG 
Lys 


AAA 
Lys 


ATC 
He 


AGA 
Arg 


GAA 
Glu 


TGT 
Cys 


AAT 
Asn 


TTA 
Leu 


TAAGAAAGAA TGCCATTGAA TTTTTTA 


826 



250 255 

GGGGAAAAAC TACAAATTTC TAATTTAGCT GAAGGAAAAT CAAGCAAGAT GAAAAGGTAA 886 

TTTTAAATTA GAGAACACAA ATAAAATGTA TTAGTGAATA AATGGTGAGG GTAGGCCTAT 94 6 

TCAGATGCAA GGCCAGCAAT GGGGCTCCCC ATTATCCCCA CCCCTTTGGT CCCAGTCCCC 1006 

TTCTCTGCAA TGGGCACGCA TAGAGGAGAG ACAAAGGGTA TTAGACGCAA CATCATTGGC 1066 
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CCAGGGGAGT CCGAGAAGAG CTGCCATTGG CTGACAGGGC ATTTTCAGGC TCTGTCATTG 1126 

GTCAGGGAGC ACACCCCAGC CTGAAGAGTG ATGCCATTGG CCAGGGAGTG GTTTTGTCAT 118 6 

AGCCGTTGGC TGTGAAGTGG AAGGAAAAGA TCTGGGAATG AAGCCCTGTG GCCAGGAAGA 124 6 

TAGACAGGGC AGCAACTTCT GGGCCTCCAG GCCCTCTTCC CACCATAGCA ATGTGGGCAA 1306 

AACTGGTGTC AGGCCCCAGC CAGAAAAAGG AGCCCAAGCC AGAGGGCAAG TGACAAAGGA 1366 

TGTACCATGT CCAATCTCCC ACACCCTGGG GCTGCCCTTC CCAATGTCTT TCTTGATAGC 14 26 

CAAGTTGGGC TGGGAGCAGC TCACTGCTCC TCTAGCCAGG AGGGTTTCTC AGCTCCTGGA 14 8 6 

GGCCGCAGCT TGATGTTGAA CTGCTGCAGG GTCTGCTCCA GCTGTTTCTG GTTCCCAGCA 154 6 

AAGTAGGCGG ACACAGCATT GTGGAAGAGC AGCAGCTGCT TGTGCATCAC CTTGATCTTG 1606 

TTTTCTTCCA GGAACTTGAG CTTGATGGCC ACATCTCCCC GCAGCTTCTC ATACTTGTCC 1666 

CGATGGGCCT GGAAAGTGGC CTGGGCACTC TCAAGTCGAC CACGTGTCCC TGCATCCCGG 172 6 

GGGCCTAGAC TCAGCTCCTC TAAGTCTGTT CGGTAGGCAT CATATTCCAG CCTGGCAGCC 178 6 

TCATACTGTT TCACAGTCAT GAGCGTGTCT TCCATGGTCT TGGTGACCAA TGTGTTGATG 18 4 6 

CTAGAGACAA AGAAGTTCAC GGCTCCTAGC AGCGTTTCCC CATTCTTGCA TAGTAGTTTC 1906 

TGTGTCTCTG CATTGTAGCC AAATTCCTCC TGAAGCTCTG GGGACTTCTG GCTGAGGTCA 1966 

GCAAAGGCAT CACCCAGTGC ATGCTGGGTC TGCAGCAGGC TGTAGAGGTG GGCTGTCAGT 2026 

GCCCGGCCCA GCTGCAGGAC ACTCTCATAC TTGCGCTTCG TCTCACGCAG CAACTCAATC 208 6 

TGCAGCTCTA GCTCCAGGAT TCCGGCGCCT CCACTCCGTC CCCCGCGGGT CTGCTCTGTG 214 6 

TGCCATGGAC GGCATTGTCC CAGATATAGC CGTTGGTACA AAGCGGGGAT CTGACGAGCT 2206 

Q TTTCTCTACT TGTGTCACTA ACGGACCGTT TATCATGAGC AGCAACTCGG CTTCTGCAGC 22 66 

<J3 AAACGGAAAT GACAGCAAGA AGTTCAAAGG TGACAGCCGA AGTGCAGGCG TCCCCTCTAG 232 6 

S.j AGTGATCCAC ATCCGGAAGC TCCCCATCGA CGTCACGGAG GGGGAAGTCA TCTCCCTGGG 238 6 

GCTGCCCTTT GGGAAGGTCA CCAACCTCCT GATGCTGAAG GGGAAAAACC AGGCCTTCAT 24 4 6 

n CGAGATGAAC ACGGAGGAGG CTGCCAATAC CATGGTGAAC TACTACACCT CGGTGACCCC 2506 

TGTGCTGCGC GGCCAGCCCA TCTACATCCA GTTCTCCAAC CACAAGGAGC TGAAGACCGA 25 66 

CAGCTCTCCC AACCAGGCGC GGGCCCAGGC GGCCCTGCAG GCGGTGAACT CGGTCCAGTC 2626 

~] GGGGAACCTG GCCTTGGCTG CCTCGGCGGC GGCCGTGGAT GCAGGGATGG CGATGGCCGG 268 6 

=P GCAGAGCCCC GTGCTCAGGA TCATCGTGGA GAACCTCTTC TACCCTGTGA CCCTGGATGT 27 4 6 

s GCTGCACCAG ATTTTCTCCA AGTTCGGCAC AGTGTTGAAG ATCATCACCT TCACCAAGAA 2806 

CAACCAGTTC CAGGCCCTGC TGCAGTATGC GGACCCCGTG AGCGCCCAGC ACGCCAAGCT 28 66 

c y, GTCGCTGGAC GGGCAGAACA TCTACAACGC CTGCTGCACG CTGCGCATCG ACTTTTCCAA 2 92 6 

L& GCTCACCAGC CTCAACGTCA AGTACAACAA TGACAAGAGC CGTGACTACC TCGTGCCGAA 2 98 6 

TTCTTTGGAT CC 2 998 



a 



(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

Ala Thr Gin Ala He Phe Glu He Leu Glu Lys Ser Trp Leu Pro Gin 

15 10 15 

Asn Cys Thr Leu Val Asp Met Lys He Glu Phe Gly Val Asp Val Thr 

20 25 30 

Thr Lys Glu He Val Leu Ala Asp Val He Asp. Asn Asp Ser Trp Arg 

35 40 45 

Leu Trp Pro Ser Gly Asp Arg Ser Gin Gin Lys Asp Lys Gin Ser Tyr 

50 55 60 

Arg Asp Leu Lys Glu Val Thr Pro Glu Gly Leu Gin Met Val Lys Lys 
65 70 75 80 
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Asn 


Phe 


Glu 


Tm 


Val 


Ala 


Glu 


Arg 










85 








Ser 


Gin 


Cys 


Arg 


Val 


Val 


Val 


Leu 








100 










His 


Cys 


Glu 


Lys 


He 


Lys 


Lys 


Ala 






115 










120 


Glu 


Leu 


Arg 


Val 


Thr 


Ser 


Ala 


His 




130 










135 




lie 


Lys 


Ala 


Glu 


Tvr 


Glu 


Glv 


Asp 


145 










150 






Val 


Ala 


Gly 


Arg 


Ser 


Asn 


Gly 


Leu 










165 








Ala 


Tvr 


Pro 


Val 


He 


Ser 


Cys 


Pro 








-LOW 










Gin 


Asp 


Val 




Ser 


Ser 


Leu 


Arg 






195 










200 


Thr 


Val 


Leu 


Ser 


Pro 


Glu 


Gly 


Ser 




210 










215 




Gly 


Leu 


Ser 


Asn 


His 


Leu 


Val 


Trp 


225 










230 






Asn 


Thr 


Trp 


He 


Ser 


Leu 


Lys 


Gin 



245 

Asn Leu 



Val 


Glu 


Leu 


Leu 


Leu 


Lys 


Ser 


Glu 




90 










95 




Met 


Glv 


Ser 


Thr 


Ser Asp 


Leu 


Glv 


105 










110 






Cys 


Glv 


Asn 


Phe 


Glv 

JT 


He 


Pro 


Cys 










125 








Lvs 


Gly 


Pro 


Aso 


Glu 


Thr 


Leu 


Arg 








140 










Glv 


He 


Pro 


Thr 


Val 


Phe 


Val 


Ala 






155 










160 


Gly 


Pro 


Val 


Met 


Ser 


Gly Asn 


Thr 




170 










175 




Pro 


Leu 


Thr 


Pro 


Asp 


Trp 


Gly Val 


185 










190 






Leu 


Pro 


Ser 


Glv 


Leu Gly Cys 


Ser 










205 








Ala 


Gin 


Phe 


Ala 


Ala 


Gin 


He 


Phe 








220 










Ser 


Lys 


Leu 


Arg 


Ala 


Ser 


He 


Leu 






235 










240 


Ala 


Asp 


Lys 


Lys 


He 


Arg 


Glu 


Cys 




250 










255 





(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



He 


Gin 


Arg 


Phe 


Gly 


Thr 


Ser 


Gly His 


He 


Met 


Asn 


Leu 


Gin 


Ala 


Gin 


1 








5 








10 










15 




Pro 


Lys 


Ala 


Gin 


Asn 


Lys 


Arg 


Lys Arg 


Cys 


Leu 


Phe 


Gly Gly Gin 


Glu 








20 








25 










30 






Pro 


Ala 


Pro 


Lys 


Glu 


Gin 


Pro 


Pro Pro 


Leu 


Gin 


Pro 


Pro 


Gin 


Gin 


Ser 






35 










40 








45 








He 


Arg 


Val 


Lys 


Glu 


Glu 


Gin 


Tyr Leu 


Gly 


His 


Glu 


Gly 


Pro 


Gly Gly 




50 










55 








60 










Ala 


Val 


Ser 


Thr 


Ser 


Gin 


Pro 


Val .Glu 


Leu 


Pro 


Pro 


Pro 


Ser 


Ser 


Leu 


65 










70 








75 










80 


Ala 


Leu 


Leu 


Asn 


Ser 


Val 


Val 


Tyr Gly 


Pro 


Glu 


Arg 


Thr 


Ser 


Ala 


Ala 










85 








90 










95 




Met 


Leu 


Ser 


Gin 


Gin 


Val 


Ala 


Ser Val 


Lys 


Trp 


Pro 


Asn 


Ser 


Val 


Met 








100 








105 










110 






Ala 


Pro 


Gly 


Arg 


Gly 


Pro 


Glu 


Arg Gly 


Gly 


Gly 


Gly 


Gly Val 


Ser 


Asp 






115 










120 








125 








Ser 


Ser 


Trp 


Gin 


Gin 


Gin 


Pro 


Gly Gin 


Pro 


Pro 


Pro 


His 


Ser 


Thr 


Trp 




130 










135 








140 










Asn 


Cys 


His 


Ser 


Leu 


Ser 


Leu 


Tyr Ser 


Ala 


Thr 


Lys 


Gly Ser 


Pro 


His 


145 








150 








155 










160 


Pro 


Gly 


Val 


Gly 


Val 


Pro 


Thr 


Tyr Tyr 


Asn 


His 


Pro 


Glu 


Ala 


Leu 


Lys 
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D 

J™3 











165 










170 










175 




Ara 


Glu 


Lvs 


Ala 


Gly 


Gly 


Pro 


Gin 


Leu 


Asp 


Arq 


Tyr 


Val 


Arq 


Pro 


Met 








180 










185 










190 






Met 


Pro 


Gin 


Lys 


Val 


Gin 


Leu 


Glu 


Val 


Gly 


Ara 


Pro 


Gin 


Ala 


Pro 


Leu 






195 










200 










205 








Asn 


Ser 


Phe 


His 


Ala 


Ala 


Lys 


Lys 


Pro 


Pro 


Asn 


Gin 


Ser 


Leu 


Pro 


Leu 




210 










215 










220 










Gin 


Pro 


Phe 


Gin 


Leu 


Ala 


Phe 


Glv 


His 


Gin 


Val 


Asn 


Ara 


Gin 


Val 


Phe 


225 










230 










235 










240 


Arg 


Gin 


Glv 


Pro 


Pro 


Pro 


Pro 


Asn 


Pro 


Val 


Ala 


Ala 


Phe 


Pro 


Pro 


Gin 










245 










250 










255 




Lys 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Pro 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 








260 










265 










270 






Ala 


Ala 


Leu 


Pro 


Gin 


Met 


Pro 


Leu 


Phe 


Glu 


Asn 


Phe 


Tvr 


Ser 


Met 


Pro 






275 










280 










285 








Gin 


Gin 


Prn 

t j. w 


Ser 


Gin 


Gin 


Pro 


Gin 


Asp 


Phe 


Gly 


Leu 


Gin 


Pro 


Ala 


Gly 




290 










295 










300 










Pro 


Leu 


Glv 


Gin 


Ser 


His 


Leu 


Ala 


His 


His 


Ser 


Met 


Ala 


Pro 


Tyr 


Pro 


305 










310 










315 










320 


Phe 


Pro 


Pro 


Asn 


Pro 


Asp 


Met 


Asn 


Pro 


Glu 


Leu 


Arg 


Lys 


Ala 


Leu 


Leu 










325 










330 










335 




Gin 


Asp 


Ser 


Ala 


Pro 


Gin 


Pro 


Ala 


Leu 


Pro 


Gin 


Val 


Gin 


He 


Pro 


Phe 








340 










345 










350 






Pro 


Arg 


Arg 


Ser 


Arg 


Arg 


Leu 


Ser 


Lys 


Glu 


Gly 


He 


Leu 


Pro 


Pro 


Ser 






j »j »j 










360 










365 








Ala 


Leu 


Asp 


Gly 


Ala 


Gly 


Thr 


Gin 


Pro 


Glv 
j 


Gin 


Glu 


Ala 


Thr 


Gly Asn 














375 










380 










Leu 


Phe 


Leu 


His 


His 


Tro 


Pro 


Leu 


Gin 


Gin 


Pro 


Pro 


Pro 


Glv 


Ser 


Leu 


385 










390 










395 










400 


Gly 


Gin 


Pro 


His 


Pro 


Glu 


Ala 


Leu 


Gly 


Phe 


Pro 


Leu 


Glu 


Leu 


Arg 


Glu 










4 05 










4 10 










415 




Ser 


Gin 


Leu 


Leu 


Pro 


Asp 


Glv 


Glu 


Arg 


Leu 


Ala 


Pro 


Asn 


Glv 

vj j. y 


Arg 


Glu 








420 










425 










430 








m n 

uxu 


Ala 


Pro 


Ala 


Met 


Gly 


Ser 


Glu 


Glu 


Gly 


Met 


Arg 


Ala 


Val 


Ser 






T J J 










4 40 










4 4 5 








Thr 


Gly 


Asp 


Cys 


Gly 


Gin 


Val 


Leu 


Arg 


Gly 


Glv 


Val 


He 


Gin 


Ser 


Thr 




4 50 










455 










460 










Arg 


Arg 


Arg 


Arg 


Arg 


Ala 


Ser 


Gin 


Glu 


Ala 


Asn 


Leu 


Leu 


Thr 


Leu 


Ala 


4 65 










470 










475 










480 


Gin 


Lys 


Ala 


Val 


Glu 


Leu 


Ala 


Ser 


Leu 


Gin 


Asn 


Ala 


Lys 


Asp 


Gly 


Ser 










485 










490 










495 




Glv 


Ser 


Glu 


Glu 


Lys 


Arg 


Lys 


Ser 


Val 


Leu 


Ala 


Ser 


Thr 


Thr 


Lys 


Cys 








500 










505 










510 






Glv 


Val 


Glu 


Phe 


Ser 


Glu 


Pro 


Ser 


Leu 


Ala 


Thr 


Lvs 


Ara 


Ala 


Arg 


Glu 






515 










520 










525 








Asp 


Ser 


Glv 


Met 


Val 


Pro 


Leu 


He 


He 


Pro 


Val 


Ser 


Val 


Pro 


Val 


Arg 




530 










535 










540 










Thr 


Val 


Asp 


Pro 


Thr 


Glu 


Ala 


Ala 


Gin 


Ala 


Glv 


Gly 


Leu 


Asp 


Glu 


Asp 


545 










550 










555 










560 


Glv 


Lys 


Glv 


Leu 


Glu 


Gin 


Asn 


Pro 


Ala 


Glu 


His 


Lys 


Pro 


Ser 


Val 


He 










565 










570 










575 




Val 


Thr 


Arg 


Arg 


Arg 


Ser 


Thr 


Arg 


He 


Pro 


Gly 


Thr 


Asp 


Ala 


Gin 


Ala 








580 










585 










590 






Gin 


Ala 


Glu 


Asp 


Met 


Asn 


Val 


Lys 


Leu 


Glu 


Gly 


Glu 


Pro 


Ser 


Val 


Arg 






595 










600 










605 








Lys 


Pro 


Lys 


Gin 


Arg 


Pro 


Arg 


Pro 


Glu 


Pro 


Leu 


He 


He 


Pro 


Thr 


Lys 




610 










615 










620 
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Ala 


Gly 


Thr 


Phe 


He 


Ala 


Pro 


Pro 


Val 


Tyr 


Ser 


Asn 


He 


Thr 


Pro 


Tyr 




625 










630 










635 










640 




Gin 


Ser 


His 


Leu 


Arg 
64 5 


Ser 


Pro 


Val 


Arg 


Leu 
650 


Ala 


Asp 


His 


Pro 


Ser 
655 


Glu 




rti. y 




Php 


OX U 


T oil 

J_i G Lt 




Pro 

t X \J 


fur 
xyr 


Thr 
iiix 

UU J 


Prn 


Pro 


Pro 


Tip 
ixe 


Leu 
670 


Ser 


Pro 




Val 


Arg 


Glu 
67 S 


Gly 


Ser 


Gly 


Leu 


Tyr 
680 


Phe 


Asn 


Ala 


He 


lie 
685 


Ser 


Thr 


Ser 




Thr 


He 


Pro 


Ala 


Pro 


Pro 


Pro 

\j y -J 


He 


Thr 


Pro 


Lys 


Ser 
700 


Ala 


His 


Arg 


Thr 




Leu 


Leu 


Arg 


Thr 


Asn 


Ser 


Ala 


Glu 


Val 


Thr 


Pro 


Pro 


Val 


Leu 


Ser 


Val 




7ns 










7 1 n 

/ X \J 










71 S 

/ 1 »J 














Mpr 
l v Je L 


pi v 


0 1 U 


rtX ct 


Thr 
7 9 S 


Pro 

IT 1U 


val 


q pr 


T 1 p 
lie 


p i n 

Ol Li 

1~KC\ 
i jy 


Pro 


Arg 


T1p 
lie 


Asn 


Val 

7 


Gly 






Arg 


cne 


pi n 

olll 

74 n 

/ 4 U 


a i a 

raid 


pi n 


Tip 

lie 


Drn 
CL V 


Leu 

7d S 


11 C L 


A rrr 
r\.L y 


Ron 


A rrr 
rtx y 


Al a 
nld 

7 sn 


Leu 


Ala 
ril d 




Al a 
nld 


Ala 
nld 


ribp 

7 S S 

/ D D 


XT 1 \J 


nib 


Lys 


Al a 
nl d 




Leu 


Ua 1 
vdl 


T rn 

irp 


Ol 1 1 


Pro 
7 fiS 


T ro 


O 1 LL 


Zlcn 
nop 




Leu 


Pin 


oci 




Arg 


n n 
ox U 


Lys 


pi n 
bin 


A vrv 
ir\L y 


Ol 11 


\/a 1 
vdl 


Ol LI 


7\ cn 


Leu 


Leu 


Thr 
lin 


o 




■7 1 n 










77 S 
/ / D 










7 ft n 












/■Via 


Ala 








Tip 
lie 




1: 1 LJ 


Pl \/ 

uiy 


Al a 
nl d 


oiy 


Th r 
iiix 


flcn 


Gin 


Pil n 

O 1 Ll 


T.oi i 
lie Li 




7 fl S 
/DO 










7 Qn 










7 Q 1 ^ 










Ann 


a t 




Leu 






Leu 


nx b 


OIL! 


9pr 
oe i 


A rrr 
ni y 


pi v 


no £J 


T 1 p 
lie 


Leu 


Glu 


Thr 


Leu 




















qi n 

0 1 u 










ft 1 ^ 

O 1 J 




ijy 


Asn 


Lys 


Leu 


Leu 


Leu 


Lys 


Lys 


trr o 


Leu 


Arg 


D >- /^v 
riO 


H -J c 
nib 


A c n 


nib 




Leu 


m 








q 9 n 
o ^ u 










o ^ o 










ft *30 








Ala 

raid 


Th r 

i nr 


xyr 


Hie 
nib 


i yr 


Thr 

1 1 IX 


pi \/ 

oxy 


Qpr 




pi n 

Ol 1 1 


Trn 

irp 




Mpf 
l le l. 


Al a. 
rti d 


Pin 

O 1 Ll 


A rrr 
ni. y 








ODD 




















ft A ^ 
O 4 D 








g 


Lys 


Leu 


irne 


Asn 


Lys 


Pl w 
oiy 


T 1 p 
lie 


Al a 
ril d 


T 1 P 
lie 


T\/r 

iyr 


Lys 


Lys 




Php 
tile- 


Php 

it lie 


T .pi i 
lie Li 


£ s 




QCfi 

ojU 










ODD 










ft £H 












Val 


Gin 


Lys 


Leu 


He 


Gin 


Thr 


Lys 


Thr 


Val 


Ala 


Gin 


Cys 


Val 


Glu 


Phe 




O DO 










q *7 n 
o / u 










pTC 
O / D 










Aft n 
o o u 


w 


Tyr 


Tyr 


Thr 


Tyr 


Lys 


Lys 


Gin 


Val 


Lys 


He 


Gly 


Arg 


Asn 


Gly 


Thr 


Leu 












QQC 

OOJ 










ft QD 

o y u 










ft 

O -7D 






i nr 


rne 


oxy 


Asp 


Va 1 


Asp 


Th r~ 

i nr 


C o -r- 


Asp 


p i n 

ol U 


Lys 


Qor 


Ala 


ol 11 


PI ll 
Ol u. 


Pl n 

OX Ll 










yuu 










on s 
y u d 










qi n 
y i u 








Val 


Glu 


Val 

Q1 C 

y ±o 


Asp 


He 


Lys 


Thr 


Ser 
y z u 


Gin 


Lys 


Phe 


Pro 


Arg 

Q9S 


Val 


Pro 


Leu 




Pro 


Arg 
you 


Arg 


Glu 


Ser 


Pro 


Ser 
y do 


Glu 


Glu 


Arg 


Leu 


Glu 

_/ 4 u 


Pro 


Lys 


Arg 


Glu 




vdl 


Lys 


Pin 
olU 


trro 


7\ r~ iT 
r\L C_j 


Lys 


pi n 

ol U 


Pl w 

oiy 


ol U 


0 1 Lt 


fil n 

OIL! 


Va 1 
vdl 


Pro 


m n 

0 1 LI 


Tip 
J. x c 


Pl n 

OX 1 1 














you 










q c; c; 










JUL/ 




- Pl n 

Ol LI 


Lys 


Pin 


pi n 


Pl n 

J 


Glu 


Glu 


Gly 


Arg 


Glu 


Arg 


Ser 


Arg 


Arg 


Ala 
975 


Ala 




Ala 


Val 


Lys 


Ala 
980 


Thr 


Gin 


Thr 


Leu 


Gin 
985 


Ala 


Asn 


Glu 


Ser 


Ala 
990 


Ser 


Asp 




He 


Leu 


He 


Leu 


Arg 


Ser 


His 


Glu 


Ser 


Asn 


Ala 


Pro 


Gly 


Ser 


Ala 


Gly 








995 










1000 








1005 








Gly 


Gin 


Ala 


Ser 


Glu 


Lys 


Pro 


Arg 


Glu 


Gly 


Thr 


Gly 


Lys 


Ser 


Arg 


Arg 






1010 








1015 








1020 










Ala 


Leu 


Pro 


Phe 


Ser 


Glu 


Lys 


Lys 


Lys 


Lys 


Lys 


Gin 


Lys 


Ala 







1025 1030 1035 



(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 84 9 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 
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Pin 
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9^ 
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Pin 
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Tyr 
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OtJ 
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bxy 


x xe 


pi 

bxy 

JJ 


Lys 


Tl Q 

xxe 


Phe 
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bU 


Asn 


vai 


Cys 


Pity 
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Thr 


Met 


Pro 


Val 


Cys 


Gly 
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i nr 


xxe 


T «n 

Xieu 


Gly Lys 
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p T - , 

bxy 


Cys 
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Glu 


Ala 


GXu 


Thr 


GXn 
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Thr 


GXu 
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Leu 


Lys 
90 


Asn 


Trp 


Lys 


Pro 


TV T _ 

Axa 
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vai 
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GXu 
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bxy 
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105 
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Tyr 
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Leu 
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Ala 
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ax a 
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ax a 
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Phe 
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Val 
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Val 


Cys 


Asn 


Asp 


Asp 


Val 


Tyr 


Ser 


GXy 


Pro 


Leu 




130 










135 










140 










1 


Lys 


Phe 
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P T — » 

GXn 


Asp 
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ixe 


Asp 


Ser 
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Gin 


(jXy 
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Arg 


Asn 
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1 nr 




1 4 o 










XoU 










155 










160 




Tyr 


Phe 


Glu 


Phe 


Glu 


Thr 


Ala 


Leu 


Ala 


Cys 


Val 


Pro 


Ser 


Pro 


Val 


Asp 












165 










170 










175 






Cys 


Gin 


Val 


Thr 


Asp 


Leu 


Ala 


Gly Asn 


Glu 


Tyr 


Asp 


Leu 


Thr 


Gly 


Leu 


\ 3 
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XoU 










185 










x y u 








Ser 


Thr 


Val 


Arg 


Lys 


Pro 


Trp 


Thr 


Ala 


Val 


Asp 


l nr 


Ser 


vax 


Asp 


Gly 
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200 










205 








22a, 


Arg 


Lys 


Arg 


Thr 


rne 


Tyr 


Leu 


Ser 


Val 


Cys 


Asn 


Pro 


Leu 


Pro 


Tyr 


He 


o 




OTA 
ZXU 










Z X D 










9 9 n 












Pro 




Cys 


bxn 


bxy 


O n v- 

oer 


ax a 


Val 
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Ser 


Cys 


Leu 


Val 


Ser 


Glu 


Gly 
























235 










240 
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oer 


Trp 


Asn 
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pi 

biy 
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-Gin 


Met 
250 
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rlO 


Gin 


Ala 


Ala 
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Ala 




Asn 


pi i/ 


oer 


Leu 


oer 


x x e 


rJcL 


Tyr 


Val 


Asn 


Gly 
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Lys 


Cys 


Gly Asn 










260 










265 










270 








Pin 

bin 


Arg 


rne 
97 ^ 


Q £i r- 

oei 


Thr* 
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Thr 
280 


Phe 


Glu 


Cys 


A 1 ^ 


Gin 
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He 
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Gly 
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Pro 
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Aia 


rne 


Lixn 


Leu 


Pin 
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Asp Gly 


Cys 


Glu 


T* \7 V 

iyr 


Val 


Phe 


He 


Trp 






290 










295 










300 












Arg 


Thr 


Val 


Glu 


Ala 


Cys 


Pro 


Val 


Val 


Arg 


Val 


Glu 


Gly Asp 


Asn 


Cys 




305 










310 










315 










320 




Glu 


Val 


Lys 


Asp 


Pro 


Arg 


His 


Gly Asn 


Leu 


Tyr 


Asp 


Leu 


Lys 


Pro 


Leu 












325 










330 










335 






Gly 


Leu 


Asn 


Asp 


Thr 


He 


Val 


Ser 


Ala 


Gly Glu 


Tyr 


Thr 


Tyr 


Tyr 


Phe 










340 










345 










350 








Arg 


Val 


Cys 


Gly 


Lys 


Leu 


Ser 


Ser Asp 


Val 


Cys 


Pro 


Thr 


Ser 


Asp 


Lys 








355 










360 










365 










Ser 


Lys 
370 


Val 


Val 


Ser 


Ser 


Cys 
375 


Gin 


Glu 


Lys 


Arg 


Glu 
380 


Pro 


Gin 


Gly 


Phe 




His 


Lys 


Val 


Ala 


Gly 


Leu 


Leu 


Thr 


Gin 


Lys 


Leu 


Thr 


Tyr 


Glu 


Asn 


Gly 




385 










390 










395 










400 
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Leu 


Leu 


Lys 


Met 


Asn 
405 


Phe 


Thr 


Gly 


Gly 


Asp 
410 


Thr 


Cys 


His 


Lys 


Val 
415 


Tyr 




Gin 


Arg 


Ser 


Thr 
420 


Ala 


He 


Phe 


Phe 


Tvr 
425 


Cys 


Asp 


Arg 


Glv 


Thr 
430 


Gin 


Ara 




Pro 


Val 


Phe 
4 35 


Leu 


Lys 


Glu 


Thr 


Ser 
4 40 


Asp 


Cys 


Ser 


Tyr 


Leu 
445 


Phe 


Glu 


Trp 




Arg 


Thr 
450 


Gin 


Tyr 


Ala 


Cys 


Pro 
455 


Pro 


Phe 


Asp 


Leu 


Thr 
460 


Glu 


Cys 


Ser 


Phe 




Lys 


Asp 


Gly 


Ala 


Gly Asn 


Ser 


Phe 


Asp 


Leu 


Ser 


Ser 


Leu 


Ser 


Arg 


Tyr 




i \j .j 










a 7 n 










475 










480 




Ser 


Asp 


Asn 


Trp 


Pin 


Aia 


He 


Thr 


Gly 


Thr 
4 90 


Gly 


Asp 


Pro 


Glu 


His 
495 


Tyr 




Leu 


He 


Asn 


Val 
500 


Cys 


Lys 


Ser 


Leu 


Ala 
505 


Pro 


Gin 


Ala 


Glv 


Thr 
510 


Glu 


Pro 




Lyb 




Pr*r\ 
IT X. 

SIS 
j l j 




Ala 


Alt 

Ala 


7\ 1 a 
r\J. cl 


L^_y o 
S90 


Leu 


lieu 


uiy 


u j. y 


Cpr 

OCI 

525 


J-J y o 


Prn 


val 






T.on 

s^o 

J O VJ 


O X ^ 


Arg 


Val 


Arg 


Dor-) 

S*}S 

J J J 


Gly 


Pro 


Gin 




Arg 
540 


Asp 


Gly 


He 


He 


o 


V Ct J. 


Leu 




xyr 


Val 


Asp 




A on 
no 


T .on 

XjC Ll 




Pro 


Asp 


vjxy 


X X c 


Arg 


Lys 


S/l S 

J fi 0 










550 










sss 

JJJ 














Lys 


OCI 


i nir 


Th r- 

i nx7 


He 


Arg 


rile 


Th r* 




C v- 




OCX 


r; 1 n 

LJX 1 1 


Ua 1 
V Ct X 


7\ O T~l 

rto 1 1 














565 










S7 n 










S7 S 




3 ' J 


7\ y~rr 
r\±- y 


Pro 


l IG l. 




He 


Ser 


AT ^ 


1 

v a _L 


ul Ll 


Acn 




OX Ll 


T ur 
xyx 


Th r 

1 1 1 X 


Phe 


Ala 


M 

5 . : 

yy 








sft n 










Sft s 

JO J 
















irp 


Prn 


Thr 




Thr 


Ala 


^y o 




ric u 


Xjy a 


OCX 


Sen 
no 1 1 


L3X Ll 


n x o 


Asp 


Asp 


m 






SQS 










con 










60S 
O VJ .J 












UJLll 


V d -L 


1 1 IX. 


Asn 


Pro 


OCX. 


X 1 IX. 


LJX y 


n x o 


XjC Ll 


Phe 


Asp 


Leu 


Ser 


Ser 






0 X U 










61 s 










O \J 










a 


Leu 


Ser 


Gly 


Arg 


Ala 


Gly 


Phe 


Thr 


Ala 


Ala 


Tyr 


Ser 


Glu 


Lys 


Gly 


Leu 














630 










O O .J 










640 




Val 


Tyr 


Met 


Ser 


He 


Cys 


Gly 


Glu 


Asn 


Glu 


Asn 


Cys 


Pro 


Pro 


Gly Val 


M= 




















6S0 










655 






uiy 


a l ^ 




Pho 


Gly 


Gin 


X 11X 


x-ix y 


Tip 

X JL C 




V Ct X 


oxy 


J_i y o 


r\±. cl 


Asn 


Lys 










D DU 










DOJ 










D / U 






0 


Arg 


Leu 


Arg 

£7 S 
O / O 


Tyr 


Val 


Asp 


Gin 


Val 

DOU 


Leu 


Gin 


Leu 


Val 


Tyr 


Lys 


Asp 


Gly 




Ser 


Pro 

6Q0 


Cys 


Pro 


Ser 


Lys 


Ser 


Gly 


Leu 


Ser 


Tyr 


Lys 
700 


Ser 


Val 


He 


Ser 




Phe 


Val 


Cys 


Arg 


IT X O 


Glu 


Ala 


Gly 


Pro 


Thr 


Asn 


Arg 


Pro 


Met 


Leu 


He 




70S 










710 










71 S 

/ X J 










720 




Ser 


Leu 


Asp 


Lys 


bin 
725 


Thr 


Cys 


Thr 


Leu 


Phe 

7^0 


Phe 


Ser 


Tro 

X X p 


His 


Thr 
735 


Pro 




— LlC Ll 


r\X ct 


\_. ^ o 


\3 J. Ll 

7 4 0 


Gin 


Ala 


Thr 


L3X Ll 


74 5 


Ser 


Val 


Arg 


Asn 


Gly 
750 


Ser 


Ser 




He 


Val 


Asp 
7 SS 


Leu 


Ser 


Pro 


Leu 


He 
760 

/WW 


His 


Arg 


Thr 


Gly 


Gly 
7 65 


Tyr 


Glu 


Ala 




Tyr 


Asp 


Glu 


Ser 


Glu 


Asp Asp 


Ala 


Ser 


Asp 


Thr 


Asn 


Pro 


Asp 


Phe 


Tyr 






770 










775 










780 












He 


Asn 


He 


Cys 


Gin 


Pro 


Leu 


Asn 


Pro 


Met 


His 


Gly 


Val 


Pro 


Cys 


Pro 




TOC 










790 










7QS 










800 




Ala 


Gly 


Ala 


Ala 


Val 
805 


Cys 


Lys 


Val 


Pro 


He 
810 


Asp 


Gly 


Pro 


Pro 


He 
815 


Asp 




He 


Gly 


Arg 


Val 
820 


Ala 


Gly 


Pro 


Pro 


He 
825 


Leu 


Asn 


Pro 


He 


Ala 
830 


Asn 


Glu 




He 


Tyr 


Leu 
835 


Asn 


Phe 


Glu 


Ser 


Ser 
840 


Thr 


Pro 


Cys 


Gin 


Glu 
845 


Phe 


Ser 


Cys 



Lys 
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(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRI 





Met 


Ala 


Arg 


Leu 


Ser 


Arg 


Pro 




1 








5 








Glu 


Asp 


Leu 


Pro 


Tyr 


Glu 


Glu 










20 










Lys 


Cys 


Trp 


Leu 


His 


Tyr 


lie 








35 










u 


Arg 


Leu 


Asn 


Gin 


Leu 


Tyr 


Glu 


*Q 




50 










55 


S. : 


Tyr 


Lys 


Leu 


Trp 


Tyr 


Arg 


Tyr 


5^ 


65 










70 




=^ 


His 


Arg 


Cys 


Val 


Thr 


Asp 


Pro 












85 






hi 


Glu 


Arg 


Ala 


Phe 


Val 


Phe 


Met 


in 

y i 








100 








+= 


Tyr 


Cys 


Gin 


Phe 


Leu 


Met 


Asp 


3 






115 










N= 


Thr 


Phe 


Asp 


Arg 


Ala 


Leu 


Arg 


3 : 
3 




130 










135 


s , 
I s * 


He 


Trp 


Pro 


Leu 


Tyr 


Leu 


Arg 


5 « S 


145 










150 






' Thr 


Ala 


Val 


Arg 


Gly 


Tyr 


Arg 












165 








Ala 


Glu 


Glu 


Tyr 


He 


Glu 


Tyr 










loU 










Ala 


Ala 


Gin 


Arg 


Leu 


Ala 


Thr 








195 












Lys 


Ala 


Gly 


Lys 


Ser 


Asn 


Tyr 






210 










215 




He 


Ser 


Gin 


Asn 


Pro 


Asp 


Lys 




- 225 










230 






He 


Arg 


Gly 


Gly 


Leu 


Thr 


Arg 












245 








Cys 


Ser 


Leu 


Ala 


Asp 


Tyr 


Tyr 










260 










Arg 


Asp 


Val 


Tyr 


Glu 


Glu 


Ala 








275 












Phe 


Thr 


Gin 


Val 


Phe 


Asp 


Ser 






290 










295 




Ala 


Ala 


Lys 


Met 


Glu 


Thr 


Ala 




305 










310 






Asp 


Val 


Asp 


Leu 


Glu 


Leu 


Arg 












325 








Arg 


Arg 


Pro 


Leu 


Leu 


Leu 


Asn 



340 



ION: SEQ ID NO: 153: 



Glu Arg 


Pro 


Asp 


Leu 


Val 


Phe 


Glu 


Glu 






10 










15 




Glu 


He 


Met 


Arg 


Asn 


Gin 


Phe 


Ser 


Val 




25 










30 






Glu 


Phe 


Lys 


Gin 


Gly 


Ala 


Pro 


Lys 


Pro 


40 










45 








Arg 


Ala 


Leu 


Lys 


Leu 


Leu 


Pro 


Cys 


Ser 










60 










Leu 


Lys 


Ala 


Arg 


Arg 


Ala 


Gin 


Val 


Lys 








75 










80 


Ala 


Tyr 


Glu 


Asp 


Val 


Asn 


Asn 


Cys 


His 






90 










95 




His 


Lys 


Met 


Pro 


Arg 


Leu 


Trp 


Leu 


Asp 




105 










110 






Gin 


Gly 


Arg 


Val 


Thr 


His 


Thr 


Arg 


Arg 


120 










125 








Ala 


Leu 


Pro 


He 


Thr 


Gin 


His 


Ser 


Arg 










140 










Phe 


Leu 


Arg 


Ser 


His 


Pro 


Leu 


Pro 


Glu 








155 










160 


Arg 


Phe 


Leu 


Lys 


Leu 


Ser 


Pro 


Glu 


Ser 






170 










175 




Leu 


Lys 


Ser 


Ser 


Asp 


Arg 


Leu 


Asp 


Glu 




185 










190 






Val 


Val 


Asn 


Asp 


Glu 


Arg 


Phe 


Val 


Ser 


200 










205 








Gin 


Leu 


Trp 


His 


Glu 


Leu 


Cys 


Asp 


Leu 










220 










Val 


Gin 


Ser 


Leu 


Asn 


Val 


Asp 


Ala 


He 








235 










240 


Phe 


Thr 


Asp 


Gin 


Leu 


Gly 


Lys 


Leu 


Trp 






250 










255 




He Arg 


Ser 


Gly 


His 


Phe 


Glu 


Lys 


Ala 




265 










270 






He 


Arg 


Thr 


Val 


Met 


Thr 


Val 


Arg 


Asp 


280 










285 








Tyr 


Ala 


Gin 


Phe 


Glu 


Glu 


Ser 


Met 


He 










300 










Ser 


Glu 


Leu 


Gly 


Arg 


Glu 


Glu 


Glu 


Asp 








315 










320 


Leu 


Ala 


Arg 


Phe 


Glu 


Gin 


Leu 


He 


Ser 






330 










335 




Ser 


Val 


Leu 


Leu 


Arg 


Gin 


Asn 


Pro 


His 




345 










350 







- 55 - 



4 



* 



d 



3 



His Val His Glu Trp His Lys Arg Val Ala Leu His Gin Gly Arg Pro 

355 360 365 

Arg Glu lie lie Asn Thr Tyr Thr Glu Ala Val Gin Thr Val Asp Pro 

370 375 380 

Phe Lys Ala Thr Gly Lys Pro His Thr Leu Trp Val Ala Phe Ala Lys 
385 390 395 400 

Phe Tyr Glu Asp Asn Gly Gin Leu Asp Asp Ala Arg Val lie Leu Glu 

405 410 415 

Lys Ala Thr Lys Val Asn Phe Lys Gin Val Asp Asp Leu Ala Ser Val 

420 425 430 

Trp Cys Gin Cys Gly Glu Leu Glu Leu Arg His Glu Asn Tyr Asp Glu 

435 440 445 

Ala Leu Arg Leu Leu Arg Lys Ala Thr Ala Leu Pro Ala Arg Arg Ala 

450 455 460 

Glu Tyr Phe Asp Gly Ser Glu Pro Val Gin Asn Arg Val Tyr Lys Ser 
465 470 475 480 

Leu Lys Val Trp Ser Met Leu Ala Asp Leu Glu Glu Ser Leu Gly Thr 

485 490 495 

Phe Gin Ser Thr Lys Ala Val Tyr Asp Arg lie Leu Asp Leu Arg He 

500 505 510 

Ala Thr Pro Gin He Val He Asn Tyr Ala Met Phe Leu Glu Glu His 

515 520 525 

Lys Tyr Phe Glu Glu Ser Phe Lys Ala Tyr Glu Arg Gly He Ser Leu 

530 535 540 

Phe Lys Trp Pro Asn Val Ser Asp He Trp Ser Thr Tyr Leu Thr Lys 
545 550 555 560 

Phe He Ala Arg Tyr Gly Gly Arg Lys Leu Glu Arg Ala Arg Asp Leu 

565 570 575 

Phe Glu Gin Ala Leu Asp Gly Cys Pro Pro Lys Tyr Ala Lys Thr Leu 

580 585 590 

Tyr Leu Leu Tyr Ala Gin Leu Glu Glu Glu Trp Gly Leu Ala Arg His 

595 600 605 

Ala Met Ala Val Tyr Glu Arg Ala Thr Arg Ala Val Glu Pro Ala Gin 

610 615 620 

Gin Tyr Asp Met Phe Asn He Tyr He Lys Arg Ala Ala Glu He Tyr 
625 630 635 640 

Gly Val Thr His Thr Arg Gly He Tyr Gin Lys Ala He Glu Val Leu 

645 650 655 

Ser Asp Glu His Ala Arg Glu Met Cys Leu Arg Phe Ala Asp Met Glu 

660 665 670 

Cys Lys Leu Gly Glu He Asp Arg Ala Arg Ala He Tyr Ser Phe Cys 

675 680 685 

Ser Gin He Cys Asp Pro Arg Thr Thr Gly Ala Phe Trp Gin Thr Trp 

690 695 700 

Lys Asp Phe Glu Val Arg His Gly Asn Glu Asp Thr He Lys Glu Met 
705 710 715 720 

Leu Arg He Arg Arg Ser Val Gin Ala Thr Tyr Asn Thr Gin Val Asn 

725 730 735 

Phe Met Ala Ser Gin Met Leu Lys Val Ser Gly Ser Ala Thr Gly Thr 

740 745 750 

Val Ser Asp Leu Ala Pro Gly Gin Ser Gly Met Asp Asp Met Lys Leu 

755 760 765 

Leu Glu Gin Arg Ala Glu Gin Leu Ala Ala Glu Ala Glu Arg Asp Gin 

770 775 780 

Pro Leu Arg Ala Gin Ser Lys He Leu Phe Val Arg Ser Asp Ala Ser 
785 790 795 800 

Arg Glu Glu Leu Ala Glu Leu Ala Gin Gin Val Asn Pro Glu Glu He 
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4 



805 810 815 

Gin Leu Gly Glu Asp Glu Asp Glu Asp Glu Met Asp Leu Glu Pro Asn 

820 825 830 

Glu Val Arg Leu Glu Gin Gin Ser Val Pro Ala Ala Val Phe Gly Ser 

835 840 845 

Leu Lys Glu Asp 
850 

(2) INFORMATION FOR SEQ ID NO: 154: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 693 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 



a i a 
3 s 

w 



Met 


Phe 


Ser 


Ala 


Leu 


Lys 


Lys 


Leu 


Val 


Gly 


Ser 


Asp 


Gin 


Ala 


Pro 


Gly 


1 








5 










10 










15 




Arg 


Asp 


Lys 


Asn 


He 


Pro 


Ala 


Gly 


Leu 


Gin 


Ser 


Met 


Asn 


Gin 


Ala 


Leu 








20 










25 










30 






Gin 


Arg 


Arg 


Phe 


Ala 


Lys 


Gly 


Val 


Gin 


Tyr 


Asn 


Met 


Lys 


He 


Val 


He 






35 










40 










45 








Arg 


Gly 


Asp 


Arg 


Asn 


Thr 


Gly 


Lys 


Thr 


Ala 


Leu 


Trp 


His 


Arg 


Leu 


Gin 




50 










55 










60 










Gly 


Arg 


Pro 


Phe 


Val 


Glu 


Glu 


Tyr 


He 


Pro 


Thr 


Gin 


Glu 


He 


Gin 


Val 


65 










70 










75 










80 


Thr 


Ser 


He 


His 


Trp 


Ser 


Tyr 


Lys 


Thr 


Thr 


Asp Asp 


He 


Val 


Lys 


Val 










85 










90 










95 




Glu 


Val 


Trp 


Asp 


Val 


Val 


Asp 


Lys 


Gly 


Lys 


Cys 


Lys 


Lys 


Arg 


Gly Asp 








100 










105 










110 






Gly 


Leu 


Lys 


Met 


Glu 


Asn 


Asp 


Pro 


Gin 


Glu 


Xaa 


Glu 


Ser 


Glu 


Met 


Ala 






115 










120 










125 








Leu 


Asp 


Ala 


Glu 


Phe 


Leu 


Asp 


Val 


Tyr 


Lys 


Asn 


Cys 


Asn 


Gly 


Val 


Val 




130 










135 










140 










Met 


Met 


Phe 


Asp 


He 


Thr 


Lys 


Gin 


Trp 


Thr 


Phe 


Asn 


Tyr 


He 


Leu 


Arg 


145 










150 










155 










160 


Glu 


Leu 


Pro 


Lys 


Val 


Pro 


Thr 


His 


Val 


Pro 


Val 


Cys 


Val 


Leu 


Gly Asn 










165 










170 










175 




Tyr 


Arg 


Asp 


Met 


Gly 


Glu 


His 


Arg 


Val 


He 


Leu 


Pro 


Asp 


Asp 


Val 


Arg 








180 










185 










190 






Asp 


Phe 


He 


Asp 


Asn 


Leu 


Asp 


Arg 


Pro 


Pro 


Gly 


Ser 


Ser 


Tyr 


Phe 


Arg 






195 










200 










205 








Tyr 


Ala 


Glu 


Ser 


Ser 


Met 


Lys 


Asn 


Ser 


Phe 


Gly Leu 


Lys 


Tyr 


Leu 


His 




210 










215 










220 










Lys 


Phe 


Phe 


Asn 


He 


Pro 


Phe 


Leu 


Gin 


Leu 


Gin 


Arg 


Glu 


Thr 


Leu 


Leu 


225 










230 










235 










240 


Arg 


Gin 


Leu 


Glu 


Thr 


Asn 


Gin 


Leu 


Asp 


Met 


Asp 


Ala 


Thr 


Leu 


Glu 


Glu 










245 










250 










255 




Leu 


Ser 


Val 


Gin 


Gin 


Glu 


Thr 


Glu 


Asp 


Gin 


Asn 


Tyr 


Gly 


He 


Phe 


Leu 








260 










265 










270 






Glu 


Met 


Met 


Glu 


Ala 


Arg 


Ser 


Arg 


Gly 


His 


Ala 


Ser 


Pro 


Leu 


Ala 


Ala 






275 










280 










285 








Asn 


Gly 


Gin 


Ser 


Pro 


Ser 


Pro 


Gly 


Ser 


Gin 


Ser 


Pro 


Val 


Leu 


Pro 


Ala 




290 










295 










300 










Pro 


Ala 


Val 


Ser 


Thr 


Gly 


Ser 


Ser 


Ser 


Pro Gly 


Thr 


Pro 


Gin 


Pro 


Ala 
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m 



305 310 
Pro Gin Leu Pro Leu Asn Ala Ala 
325 

Pro Pro Ser Glu Ala Leu Pro Pro 
340 

Pro Arg Arg Ser lie lie Ser Arg 
355 360 
Glu Ala Ala Pro Pro Pro Pro Glu 

370 375 
Ala Thr Val Gin Ser Val Glu Asp 
385 390 
Arg Ser Phe Leu Glu Asp Thr Thr 
405 

Gly Ala Lys Ala Ala Gin Gin Asp 
420 

Gly Gly Asn Pro Met Val Ala Gly 
435 440 
k Asp Gin Pro Arg Gly Ser Pro Pro 
^ 450 455 

Gin Asp lie Thr Leu Ser Ser Glu 
465 470 
Thr Lys Gly Pro Ala Pro Ala Pro 
485 

Lys Trp Ser Ser lie Pro Ala Ser 
500 

Thr Arg Thr Ala Ala Pro Pro Trp 
515 520 
Gly Pro Glu Lys Arg Ser Ser Thr 

530 535 
Gly Lys Gly Glu Gin Ala Ser Ser 
545 550 
He Ala Ala Gin Met Leu Ser Phe 
565 

Ser Glu Gly Ser Asp Thr Gin Arg 
580 

Asp Asp Pro Ser Asp Val Thr Asp 
595 600 
Pro Pro Pro Pro Lys Leu Pro Leu 

610 615 
Ser Asp Leu Phe Gly Leu Gly Leu 
625 630 
Ser Glu Glu Gly Lys Glu Gly Lys 
645 

Lys Thr Lys Ser Phe Ser Arg Val 
660 

Arg Phe Ser Thr Arg Val Gly Tyr 
675 680 
Tyr Ser Glu Ser Tyr 
690 



315 

Pro Pro Ser Ser 
330 

Pro Ala Cys Pro 
345 

Leu Phe Gly Thr 

Pro Val Pro Ala 
380 

Phe Val Pro Asp 
395 

Pro Ala Arg Asp 
410 

Ser Asp Ser Asp 
425 

Phe Gin Asp Asp 

Leu Pro Ala Gly 
460 

Glu Glu Ala Glu 
475 

Gin Gin Cys Ser 
490 

Lys Pro Arg Arg 
505 

Pro Gly Gly Val 

Arg Pro Pro Ala 
540 

Ser Glu Ser Asp 
555 

Val Met Asp Asp 
570 

Arg Ala Asp Asp 
585 

Glu Asp Glu Gly 

Pro Ala Phe Arg 
620 

Glu Glu Ala Gly 
635 

Thr Pro Ser Lys 
650 

Leu Leu Glu Arg 
665 

Gin Val Ser Val 



320 

Val Pro Pro Val 
335 

Ser Ala Pro Ala 
350 

Ser Pro Ala Thr 
365 

Ala Gin Gly Pro 

Asp Arg Leu Asp 
400 

Glu Lys Lys Val 
415 

Gly Glu Ala Leu 
430 

Val Asp Leu Glu 
445 

Pro Val Pro Ser 

Val Ala Ala Pro 
480 

Glu Pro Glu Thr 
4 95 

Gly Thr Ala Pro 
510 

Ser Val Arg Thr 
525 

Glu Met Glu Pro 

Pro Glu Gly Pro 
560 

Pro Asp Phe Glu 
575 

Phe Pro Val Arg 
590 

Pro Ala Glu Pro 
605 

Leu Lys Asn Asp 

Pro Lys Glu Ser 
640 

Glu Lys Lys Lys 
655 

Pro Arg Ala His 
670 

Pro Asn Ser Pro 
685 
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